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COMPOSITIONS AND METHODS FOR THE DIAGNOSIS AND TREATMENT OF TUMOR 

FIELD OP THE INVENTION 
The present invention is directed to compositions of matter useful for the diagnosis and treatment of 
tumor in mammals and to methods of using those compositions of matter for the same. 

BACKGROUND OF THE INVENTTON 
Malignant tumors (cancers) are the second leading cause of death in the United States, after heart 
disease (Boring et al., CA Cancel J. Clin. 43:7 (1993)). Cancer is characterized by the increase in the number 
of abnormal, or neoplastic, cells derived from a normal tissue which proliferate to form a tumor mass, the 
invasion of adjacent tissues by these neoplastic tumor cells, and the generation of malignant cells which 
eventually spread via the blood or lymphatic system to regional lymph nodes and to distant sites via a process 
called metastasis. In a cancerous state, a cell proliferates under conditions in which normal cells would not 
grow. Cancer manifests itself in a wide variety of forms, characterized by different degrees of invasiveness 
and aggressiveness. 

In attempts to discover effective cellular targets for cancer diagnosis and therapy, researchers have 
sought to identify transmembrane or otherwise membrane-associated polypeptides that are specifically expressed 
on the surface of one or more particular type(s) of cancer cell as compared to on one or more normal non- 
cancerous cell(s). Often, such membrane-associated polypeptides are more abundantly expressed on the surface 
of the cancer cells as compared to on the surface of the non-cancerous cells. The identification of such tumor- 
associated cell surface antigen polypeptides has given rise to the ability to specifically target cancer cells for 
destruction via antibody-based therapies. In this regard, it is noted that antibody-based therapy has proved very 
effective in the treatment of certain cancers. For example, HERCEPTIN® and RITUXAN® (both from 
Genentech Inc. , South San Francisco, California) are antibodies that have been used successfully to treat breast 
cancer and non-Hodgkin's lymphoma, respectively. More specifically, HERCEPTIN® is a recombinant 
DNA-derived humanized monoclonal antibody that selectively binds to the extracellular domain of the human 
epidermal growth factor receptor 2 (HER2) proto-oncogene. HER2 protein overexpression is observed in 
25-30% of primary breast cancers. RITUXAN® is a genetically engineered chimeric murine/human 
monoclonal antibody directed against the CD20 antigen found on the surface of normal and malignant B 
lymphocytes. Both these antibodies are recombinantly produced in CHO cells. 

In other attempts to discover effective cellular targets for cancer diagnosis and therapy, researchers 
have sought to identify (1) non-membrane-associated polypeptides that are specifically produced by one or more 
particular type(s) of cancer cell(s) as compared to by one or more particular type(s) of non-cancerous normal 
cell(s), (2) polypeptides that are produced by cancer cells at an expression level that is significantly higher than 
that of one or more normal non-cancerous cell(s), or (3) polypeptides whose expression is specifically limited 
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to only a single (or very limited number of different) tissue type(s) in both the cancerous and non-cancerous state 
(e.g. , normal prostate and prostate tumor tissue). Such polypeptides may remain intracellularly located or may 
be secreted by the cancer cell. Moreover, such polypeptides may be expressed not by the cancer cell itself, but 
rather by cells which produce and/or secrete polypeptides having a potentiating or growth-enhancing effect on 
cancer cells. Such secreted polypeptides are often proteins that provide cancer cells with a growth advantage 
5 over normal cells and include such things as, for example, angiogenic factors, cellular adhesion factors, growth 
factors, and the like. Identification of antagonists of such non-membrane associated polypeptides would be 
expected to serve as effective therapeutic agents for the treatment of such cancers. Furthermore, identification 
of the expression pattern of such polypeptides would be useful for the diagnosis of particular cancers in 
mammals. 

1 0 Despite the above identified advances in mammalian cancer therapy, there is a great need for additional 

diagnostic and therapeutic agents capable of detecting the presence of tumor in a mammal and for effectively 
inhibiting neoplastic cell growth, respectively. Accordingly, it is an objective of the present invention to 
identify: (1) cell membrane-associated polypeptides that are more abundantly expressed on one or more type(s) 
of cancer cell(s) as compared to on normal cells or on other different cancer cells, (2) non-membrane-associated 

1 5 polypeptides that are specifically produced by one or more particular type(s) of cancer cell(s) (or by other cells 
that produce polypeptides having a potentiating effect on the growth of cancer cells) as compared to by one or 
more particular type(s) of non-cancerous normal cell(s), (3) non-membrane-associated polypeptides that are 
produced by cancer cells at an expression level that is significantly higher than that of one or more normal non- 
cancerous cell(s), or (4) polypeptides whose expression is specifically limited to only a single (or very limited 

20 number of different) tissue type(s) in both a cancerous and non-cancerous state (e.g., normal prostate and 
prostate tumor tissue), and to use those polypeptides, and their encoding nucleic acids, to produce compositions 
of matter useful in the therapeutic treatment and diagnostic detection of cancer in mammals. It is also an 
objective of the present invention to identify cell membrane-associated, secreted or intracellular polypeptides 
whose expression is limited to a single or very limited number of tissues, and to use those polypeptides, and 

25 their encoding nucleic acids, to produce compositions of matter useful in the therapeutic treatment and diagnostic 
detection of cancer in mammals. 

SUMMARY OF THE INVENTION 

A. Embodiments 

30 111 toe present specification, Applicants describe for the first time the identification of various cellular 

polypeptides (and their encoding nucleic acids or fragments thereof) which are expressed to a greater degree 
on the surface of or by one or more types of cancer cell(s) as compared to on the surface of or by one or more 
types of normal non-cancer cells. Alternatively, such polypeptides are expressed by cells which produce and/or 
secrete polypeptides having a potentiating or growth-enhancing effect on cancer cells. Again alternatively, such 

35 polypeptides may not be overexpressed by tumor cells as compared to normal cells of the same tissue type, but 
rather may be specifically expressed by both tumor cells and normal cells of only a single or very limited 
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number of tissue types (preferably tissues which are not essential for life, e.g., prostate, etc.). All of the above 
polypeptides are herein referred to as Tumor-associated Antigenic Target polypeptides ("TAT" polypeptides) 
and are expected to serve as effective targets for cancer therapy and diagnosis in mammals. 

Accordingly, in one embodiment of the present invention, the invention provides an isolated nucleic 
5 acid molecule having a nucleotide sequence that encodes a tumor-associated antigenic target polypeptide or 
fragment thereof (a M TAT" polypeptide). 

In certain aspects, the isolated nucleic acid molecule comprises a nucleotide sequence having at least 
about 80% nucleic acid sequence identity, alternatively at least about 81%, 82%, 83%, 84%, 85%, 86%, 87%, 
88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% nucleic acid sequence identity, 

10 to (a) a DNA molecule encoding a full-length TAT polypeptide having an amino acid sequence as disclosed 
herein, a TAT polypeptide amino acid sequence lacking the signal peptide as disclosed herein, an extracellular 
domain of a transmembrane TAT polypeptide, with or without the signal peptide, as disclosed herein or any 
other specifically defined fragment of a full-length TAT polypeptide amino acid sequence as disclosed herein, 
or (b) the complement of the DNA molecule of (a). 

15 In other aspects, the isolated nucleic acid molecule comprises a nucleotide sequence having at least 

about 80% nucleic acid sequence identity, alternatively at least about 81%, 82%, 83%, 84%, 85%, 86%, 87%, 
88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% nucleic acid sequence identity, 
to (a) a DNA molecule comprising the coding sequence of a full-length TAT polypeptide cDNA as disclosed 
herein, the coding sequence of a TAT polypeptide lacking the signal peptide as disclosed herein, the coding 

20 sequence of an extracellular domain of a transmembrane TAT polypeptide, with or without the signal peptide, 
as disclosed herein or the coding sequence of any other specifically defined fragment of the Ml-length TAT 
polypeptide amino acid sequence as disclosed herein, or (b) the complement of the DNA molecule of (a). 

In further aspects, the invention concerns an isolated nucleic acid molecule comprising a nucleotide 
sequence having at least about 80% nucleic acid sequence identity, alternatively at least about 81%, 82%, 83%, 

25 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% 
nucleic acid sequence identity, to (a) a DNA molecule that encodes the same mature polypeptide encoded by 
the full-length coding region of any of the human protein cDNAs deposited with the ATCC as disclosed herein, 
or (b) the complement of the DNA molecule of (a). 

Another aspect of the invention provides an isolated nucleic acid molecule comprising a nucleotide 

30 sequence encoding a TAT polypeptide which is either transmembrane domain-deleted or transmembrane domain- 
inactivated, or is complementary to such encoding nucleotide sequence, wherein the transmembrane domain(s) 
of such polypeptide(s) are disclosed herein. Therefore, soluble extracellular domains of the herein described 
TAT polypeptides are contemplated. 

In other aspects, the present invention is directed to isolated nucleic acid molecules which hybridize 

35 to (a) a nucleotide sequence encoding a TAT polypeptide having a full-length amino acid sequence as disclosed 
herein, a TAT polypeptide amino acid sequence lacking the signal peptide as disclosed herein, an extracellular 
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domain of a transmembrane TAT polypeptide, with or without the signal peptide, as disclosed herein or any 
other specifically defined fragment of a full-length TAT polypeptide amino acid sequence as disclosed herein, 
or (b) the complement of the nucleotide sequence of (a). In this regard, an embodiment of the present invention 
is directed to fragments of a full-length TAT polypeptide coding sequence, or the complement thereof, as 
disclosed herein, that may find use as, for example, hybridization probes useful as, for example, diagnostic 
probes, antisense oligonucleotide probes, or for encoding fragments of a full-length TAT polypeptide mat may 
optionally encode a polypeptide comprising a binding site for an anti-TAT polypeptide antibody, a TAT binding 
oligopeptide or other small organic molecule that binds to a TAT polypeptide. Such nucleic acid fragments are 
usually at least about 5 nucleotides in length, alternatively at least about 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 
17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 
105, 110, 115, 120, 125, 130, 135, 140, 145, 150, 155, 160, 165, 170, 175, 180, 185, 190, 195, 200, 210, 
220, 230, 240, 250, 260, 270, 280, 290, 300, 310, 320, 330, 340, 350, 360, 370, 380, 390, 400, 410, 420, 
430, 440, 450, 460, 470, 480, 490, 500, 510, 520, 530, 540, 550, 560, 570, 580, 590, 600, 610, 620, 630, 
640, 650, 660, 670, 680, 690, 700, 710, 720, 730, 740, 750, 760, 770, 780, 790, 800, 810, 820, 830, 840, 
850, 860, 870, 880, 890, 900, 910, 920, 930, 940, 950, 960, 970, 980, 990, or 1000 nucleotides in length, 
15 wherein in this context the term "about" means the referenced nucleotide sequence length plus or minus 10% 
of that referenced length. It is noted that novel fragments of a TAT polypeptide-encoding nucleotide sequence 
may be determined in a routine manner by aligning the TAT polypeptide-encoding nucleotide sequence with 
other known nucleotide sequences using any of a number of well known sequence alignment programs and 
detennining which TAT polypeptide-encoding nucleotide sequence fragment(s) are novel. All of such novel 
20 fragments of TAT polypeptide-encoding nucleotide sequences are contemplated herein. Also contemplated are 
the TAT polypeptide fragments encoded by these nucleotide molecule fragments, preferably those TAT 
polypeptide fragments that comprise a binding site for an anti-TAT antibody, a TAT binding oligopeptide or 
other small organic molecule that binds to a TAT polypeptide. 

In another embodiment, the invention provides isolated TAT polypeptides encoded by any of the 
25 isolated nucleic acid sequences hereinabove identified. 

In a certain aspect, the invention concerns an isolated TAT polypeptide, comprising an amino acid 
sequence having at least about 80% amino acid sequence identity, alternatively at least about 81%, 82%, 83%, 
84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% amino 
acid sequence identity, to a TAT polypeptide having a full-length amino acid sequence as disclosed herein, a 
30 TAT polypeptide ainino acid sequence lacking the signal peptide as disclosed herein, an extracellular domain 
of a transmembrane TAT polypeptide protein, with or without the signal peptide, as disclosed herein, an amino 
acid sequence encoded by any of the nucleic acid sequences disclosed herein or any other specifically defined 
fragment of a full-length TAT polypeptide amino acid sequence as disclosed herein. 

In a further aspect, the invention concerns an isolated TAT polypeptide comprising an amino acid 
35 sequence having at least about 80% amino acid sequence identity, alternatively at least about 81%, 82%, 83%, 
84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% amino acid 
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sequence identity, to an amino acid sequence encoded by any of the human protein cDNAs deposited with the 
ATCC as disclosed herein. 

In a specific aspect, the invention provides an isolated TAT polypeptide without the N-terminal signal 
sequence and/or without the initiating methionine and is encoded by a nucleotide sequence that encodes such 
an amino acid sequence as hereinbefore described. Processes for producing the same are also herein described, 
5 wherein those processes comprise culturing a host cell comprising a vector which comprises the appropriate 
encoding nucleic acid molecule under conditions suitable for expression of the TAT polypeptide and recovering 
the TAT polypeptide from the cell culture. 

Another aspect of the invention provides an isolated TAT polypeptide which is either transmembrane 
domain-deleted or transmembrane domain-inactivated. Processes for producing the same are also herein 
10 described, wherein those processes comprise culturing a host cell comprising a vector which comprises the 
appropriate encoding nucleic acid molecule under conditions suitable for expression of the TAT polypeptide and 
recovering the TAT polypeptide from the cell culture. 

In other embodiments of the present invention, the invention provides vectors comprising DNA 
encoding any of the herein described polypeptides. Host cells comprising any such vector are also provided. 
15 By way of example, the host cells may be CHO cells, E. coli cells, or yeast cells. A process for producing any 
of the herein described polypeptides is further provided and comprises culturing host cells under conditions 
suitable for expression of the desired polypeptide and recovering the desired polypeptide from the cell culture. 

In other embodiments, the invention provides isolated chimeric polypeptides comprising any of the 
herein described TAT polypeptides fused to a heterologous (non-TAT) polypeptide. Example of such chimeric 
20 molecules comprise any of the herein described TAT polypeptides fused to a heterologous polypeptide such as, 
for example, an epitope tag sequence or a Fc region of an immunoglobulin. 

In another embodiment, the invention provides an antibody which binds, preferably specifically, to 
any of the above or below described polypeptides. Optionally, the antibody is a monoclonal antibody, antibody 
fragment, chimeric antibody, humanized antibody, single-chain antibody or antibody that competitively inhibits 
25 the binding of an anti-TAT polypeptide antibody to its respective antigenic epitope. Antibodies of the present 
invention may optionally be conjugated to a growth inhibitory agent or cytotoxic agent such as a toxin, 
including, for example, a maytansinoid or calicheamicin, an antibiotic, a radioactive isotope, a nucleolytic 
enzyme, or the like. The antibodies of the present invention may optionally be produced in CHO cells or 
bacterial cells and preferably induce death of a cell to which they bind. For diagnostic purposes, the antibodies 
30 of the present invention may be detectably labeled, attached to a solid support, or the like. 

In other embodiments of the present invention, the invention provides vectors comprising DNA 
encoding any of the herein described antibodies. Host cell comprising any such vector are also provided. By 
way of example, the host cells may be CHO cells, E. coli cells, or yeast cells. A process for producing any 
of the herein described antibodies is further provided and comprises culturing host cells under conditions suitable 
35 for expression of the desired antibody and recovering the desired antibody from the cell culture. 

In another embodiment, the invention provides oligopeptides ("TAT binding oligopeptides") which 
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bind, preferably specifically, to any of the above or below described TAT polypeptides. Optionally, the TAT 
binding oligopeptides of the present invention may be conjugated to a growth inhibitory agent or cytotoxic agent 
such as a toxin, including, for example, a maytansinoid or calicheamicin, an antibiotic, a radioactive isotope, 
a nucleolytic enzyme, or the like. The TAT binding oligopeptides of the present invention may optionally be 
produced in CHO cells or bacterial cells and preferably induce death of a cell to which they bind. For 
5 diagnostic purposes, the TAT binding oligopeptides of the present invention may be detectably labeled, attached 
to a solid support, or the like. 

In other embodiments of the present invention, the invention provides vectors comprising DNA 
encoding any of the herein described TAT binding oligopeptides. Host cell comprising any such vector are also 
provided. By way of example, the host cells may be CHO cells, E. coli cells, or yeast cells. A process for 

1 0 producing any of the herein described TAT binding oligopeptides is further provided and comprises culturing 
host cells under conditions suitable for expression of the desired oligopeptide and recovering the desired 
oligopeptide from the cell culture. 

In another embodiment, the invention provides small organic molecules (TAT binding organic 
molecules") which bind, preferably specifically, to any of the above or below described TAT polypeptides. 

15 Optionally, the TAT binding organic molecules of the present invention may be conjugated to a growth 

inhibitory agent or cytotoxic agent such as a toxin, including, for example, a maytansinoid or calicheamicin, 
an antibiotic, a radioactive isotope, a nucleolytic enzyme, or the like. The TAT binding organic molecules of 
the present invention preferably induce death of a cell to which they bind. For diagnostic purposes, the TAT 
binding organic molecules of the present invention may be detectably labeled, attached to a solid support, or 

20 the like. 

In a still further embodiment, the invention concerns a composition of matter comprising a TAT 
polypeptide as described herein, a chimeric TAT polypeptide as described herein, an anti-TAT antibody as 
described herein, a TAT binding oligopeptide as described herein, or a TAT binding organic molecule as 
described herein, in combination with a carrier. Optionally, the carrier is a pharmaceutically acceptable carrier. 

25 In yet another embodiment, the invention concerns an article of manufacture comprising a container 

and a composition of matter contained within the container, wherein the composition of matter may comprise 
a TAT polypeptide as described herein, a chimeric TAT polypeptide as described herein, an anti-TAT antibody 
as described herein, a TAT binding oligopeptide as described herein, or a TAT binding organic molecule as 
described herein. The article may further optionally comprise a label affixed to the container, or a package 

30 insert included with the container, that refers to the use of the composition of matter for the therapeutic 
treatment or diagnostic detection of a tumor. 

Another embodiment of the present invention is directed to the use of a TAT polypeptide as described 
herein, a chimeric TAT polypeptide as described herein, an anti-TAT polypeptide antibody as described herein, 
a TAT binding oligopeptide as described herein, or a TAT binding organic molecule as described herein, for 

35 the preparation of a medicament useful in the treatment of a condition which is responsive to the TAT 

polypeptide, chimeric TAT polypeptide, anti-TAT polypeptide antibody, TAT binding oligopeptide, or TAT 
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binding organic molecule. 

B. Additiona l F.mhnHimpntc 

Another embodiment of the present invention is directed to a method for inhibiting the growth of a cell 
that expresses a TAT polypeptide, wherein the method comprises contacting the cell with an antibody, an 
oligopeptide or a small organic molecule that binds to the TAT polypeptide, and wherein (he binding of the 
5 antibody, oligopeptide or organic molecule to the TAT polypeptide causes inhibition of the growth of the cell 
expressing the TAT polypeptide. In preferred embodiments, the cell is a cancer cell and binding of the 
antibody, oligopeptide or organic molecule to the TAT polypeptide causes death of the cell expressing the TAT 
polypeptide. Optionally, the antibody is a monoclonal antibody, antibody fragment, chimeric antibody, 
humanized antibody, or single-chain antibody. Antibodies, TAT binding oligopeptides and TAT binding organic 

10 molecules employed in the methods of the present invention may optionally be conjugated to a growth inhibitory 
agent or cytotoxic agent such as a toxin, including, for example, a maytansinoid or calicheamicin, an antibiotic, 
a radioactive isotope, a nucleolytic enzyme, or the like. The antibodies and TAT binding oligopeptides 
employed in the methods of the present invention may optionally be produced in CHO cells or bacterial cells. 
Yet another embodiment of the present invention is directed to a method of therapeutically treating a 

15 mammal having a cancerous tumor comprising cells that express a TAT polypeptide, wherein the method 
comprises administering to the mammal a therapeutically effective amount of an antibody, an oligopeptide or 
a small organic molecule that binds to the TAT polypeptide, thereby resulting in the effective therapeutic 
treatment of the tumor. Optionally, the antibody is a monoclonal antibody, antibody fragment, chimeric 
antibody, humanized antibody, or single-chain antibody. Antibodies, TAT binding oligopeptides and TAT 

20 binding organic molecules employed in the methods of the present invention may optionally be conjugated to 
a growth inhibitory agent or cytotoxic agent such as a toxin, including, for example, a maytansinoid or 
calicheamicin, an antibiotic, a radioactive isotope, a nucleolytic enzyme, or the like. The antibodies and 
oligopeptides employed in the methods of the present invention may optionally be produced in CHO cells or 
bacterial cells. 

25 Yet m ° am embodiment of the present invention is directed to a method of determining the presence 

of a TAT polypeptide in a sample suspected of containing the TAT polypeptide, wherein the method comprises 
exposing the sample to an antibody, oligopeptide or small organic molecule that binds to the TAT polypeptide 
and deteirmining binding of the antibody, oligopeptide or organic molecule to the TAT polypeptide in the sample, 
wherein the presence of such binding is indicative of the presence of the TAT polypeptide in the sample. 

30 Optionally, the sample may contain cells (which may be cancer cells) suspected of expressing the TAT 

polypeptide. The antibody, TAT binding oligopeptide or TAT binding organic molecule employed in the 
method may optionally be detectably labeled, attached to a solid support, or the like. 

A further embodiment of the present invention is directed to a method of diagnosing the presence of 
a tumor in a mammal, wherein the method comprises detecting the level of expression of a gene encoding a 

35 TAT polypeptide (a) in a test sample of tissue cells obtained from said mammal, and (b) in a control sample of 
known normal non-cancerous cells of the same tissue origin or type, wherein a higher level of expression of the 
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TAT polypeptide in the test sample, as compared to the control sample, is indicative of the presence of tumor 
in the mammal from which the test sample was obtained. 

Another embodiment of the present invention is directed to a method of diagnosing the presence of a 
tumor in a mammal, wherein the method comprises (a) contacting a test sample comprising tissue cells obtained 
from the mammal with an antibody, oligopeptide or small organic molecule that binds to a TAT polypeptide and 
(b) detecting the formation of a complex between the antibody, oligopeptide or small organic molecule and the 
TAT polypeptide in the test sample, wherein the formation of a complex is indicative of the presence of a tumor 
in the mammal. Optionally, the antibody, TAT binding oligopeptide or TAT binding organic molecule 
employed is detectably labeled, attached to a solid support, or the like, and/or the test sample of tissue cells is 
obtained from an individual suspected of having a cancerous tumor. 

Yet another embodiment of the present invention is directed to a method for treating or preventing a 
cell proliferative disorder associated with altered, preferably increased, expression or activity of a TAT 
polypeptide, the method comprising administering to a subject in need of such treatment an effective amount 
of an antagonist of a TAT polypeptide. Preferably, the cell proliferative disorder is cancer and the antagonist 
of the TAT polypeptide is an anti-TAT polypeptide antibody, TAT binding oligopeptide, TAT binding organic 
molecule or antisense oligonucleotide. Effective treatment or prevention of the cell proliferative disorder may 
be a result of direct killing or growth inhibition of cells that express a TAT polypeptide or by antagonizing the 
cell growth potentiating activity of a TAT polypeptide. 

Yet another embodiment of the present invention is directed to a method of binding an antibody, 
oligopeptide or small organic molecule to a cell that expresses a TAT polypeptide, wherein the method 
comprises contacting a cell that expresses a TAT polypeptide with said antibody, oligopeptide or small organic 
molecule under conditions which are suitable for bmding of the antibody, oligopeptide or small organic molecule 
to said TAT polypeptide and allowing binding therebetween. 

Other embodiments of the present invention are directed to the use of (a) a TAT polypeptide, (b) a 
nucleic acid encoding a TAT polypeptide or a vector or host cell comprising that nucleic acid, (c) an anti-TAT 
polypeptide antibody, (d) a TAT-binding oligopeptide, or (e) a TAT-binding small organic molecule in the 
preparation of a medicament useful for (i) the therapeutic treatment or diagnostic detection of a cancer or tumor, 
or (ii) the therapeutic treatment or prevention of a cell proliferative disorder. 

Another embodiment of the present invention is directed to a method for inhibiting the growth of a 
cancer cell, wherein the growth of said cancer cell is at least in part dependent upon the growth potentiating 
effect(s) of a TAT polypeptide (wherein the TAT polypeptide may be expressed either by the cancer cell itself 
or a cell that produces polypeptide(s) that have a growth potentiating effect 6n cancer cells) , wherein the method 
comprises contacting the TAT polypeptide with an antibody, an oligopeptide or a small organic molecule that 
binds to the TAT polypeptide, thereby antagonizing the growth-potentiating activity of the TAT polypeptide and, 
in turn, inhibiting the growth of the cancer cell. Preferably the growth of the cancer cell is completely inhibited! 
Even more preferably, binding of the antibody, oligopeptide or small organic molecule to the TAT polypeptide 
induces the death of the cancer cell. Optionally, the antibody is a monoclonal antibody, antibody fragment, 
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chimeric antibody, humanized antibody, or single-chain antibody. Antibodies, TAT binding oligopeptides and 
TAT binding organic molecules employed in the methods of the present invention may optionally be conjugated 
to a growth inhibitory agent or cytotoxic agent such as a toxin, including, for example, a maytansinoid or 
calicheamicin, an antibiotic, a radioactive isotope, a nucleolytic enzyme, or the like. The antibodies and TAT 
binding oligopeptides employed in the methods of the present invention may optionally be produced in CHO 
5 cells or bacterial cells. 

Yet another embodiment of the present invention is directed to a method of therapeutically treating a 
tumor in a mammal, wherein the growth of said tumor is at least in part dependent upon the growth potentiating 
effect(s) of a TAT polypeptide, wherein the method comprises administering to the mammal a therapeutically 
effective amount of an antibody, an oligopeptide or a small organic molecule that binds to the TAT polypeptide, 

10 thereby antagonizing the growth potentiating activity of said TAT polypeptide and resulting in the effective 
therapeutic treatment of the tumor. Optionally, the antibody is a monoclonal antibody, antibody fragment, 
chimeric antibody, humanized antibody, or single-chain antibody. Antibodies, TAT binding oligopeptides and 
TAT binding organic molecules employed in the methods of the present invention may optionally be conjugated 
to a growth inhibitory agent or cytotoxic agent such as a toxin, including, for example, a maytansinoid or 

15 calicheamicin, an antibiotic, a radioactive isotope, a nucleolytic enzyme, or the like. The antibodies and 

oligopeptides employed in the methods of the present invention may optionally be produced in CHO cells or 
bacterial cells. 

Yet further embodiments of the present invention will be evident to the skilled artisan upon a reading 
of the present specification. 

20 

BRIEF DESCRIPTION OF THE DRAWINGS 
In the list of figures for the present application, specific cDNA sequences which are upregulated in 
certain tumor tissues as compared to their normal tissue counterparts are individually identified with a 
designation beginning with the letters "DNA" followed by a specific numerical designation. A full or partial 
25 length protein sequence that is encoded by a cDNA sequence identified and shown herein is individually 
identified with a designation beginning with the letters "PRO" followed by a specific numerical designation. 
Figures showing encoded amino acid sequences immediately follow the figure showing the cDNA sequence 
encoding that specific amino acid sequence. If start and/or stop codons have been identified in a cDNA 
sequence shown in the attached figures, they are shown in bold and underlined font. 

30 
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Figure 40: DNA323738, XM .030920, gen.XM J030920 
Figure 41: DNA323739, NMj018948, gen.NM J018948 
Figure 42: DNA273712, NM .007262, gen.NM J007262 
Figure 43: PR061679 

Figure 44: DNA1 5 1 148, NM .00478 1 , gen.NM J00478 1 
Figure 45: PR012618 

Figure 46: DNA323740, XM .086 151, gen.XM J086151 
Figure 47: PRO80497 

Figure 48: DNA171408, NM .004401, gen.NM J004401 
Figure 49: PRO20136 

Figure 50: DNA323741, NM .003 132, gen.NM J003 132 
Figure 51: PRO80498 

Figure 52: DNA323742, XM .0865 86, gen.XM J0865 86 



Figure 53: PRO80499 

Figure 54: DNA323743, XM .086587, gen .XM J0865 87 

Figure 55: DNA323744, XM. 059230, gen.XM .059230 

Figure 56: PRO80501 

Figure 57A-B: DNA323745, XM .0487 80, 

gen.XM .048780 

Figure 58: DNA323746,XM .053183, gen.XM J053 183 
Figure 59: DNA323747, XM .165442, gen.XM .165442 
Figure 60: DNA323748,NM .03 3440, gen.NM J03 3440 
Figure 61: PR02269 

Figure 62: DNA323749, NM .024329, gen.NM J0243 29 
Figure 63: PRO80505 

Figure 64: DNA323750, XM .0 1 8205 , gen.XM JO 1 8205 
Figure 65: PRO80506 

Figure 66: DNA323751, XM.01 1650, gen.XM jOI 1650 
Figure 67: DNA323752,XM.017315,gen.XMj017315 
Figure 68A-B: DNA323753, XM.030470, 
genJCM.030470 

Figure 69: DNA323754, NM. 004930, gen.NM .004930 
Figure 70: PRO80510 

Figure 71: DNA323755, NM. 003689, gen.NM J003689 
Figure 72: PRO80511 

Figure 73: DNA323756,NM.016183,gen.NMj016183 
Figure 74: PRO80512 

Figure 75: DNA323757, XM .015234, gen.XM JO 15234 
Figure 76A-B: DNA323758, XM.027916, 
gen.XM.027916 

Figure 77: DNA323759, XM .033683, gen.XM J033683 
Figure 78: DNA323760, XM .00 1826, gen.XM .00 1826 
Figure 79: DNA323761, XM.033654, gen.XM .033654 
Figure 80: PRO80517 

Figure 81: DNA323762,NM .00 1791, gen.NM JOO 1791 
Figure 82: PR026194 

Figure 83: DNA323763, NM .005826, gen.NM _005 826 
Figure 84: PRO608 15 

Figure 85: DNA323764, XM .086357, gen.XM J086357 
Figure 86: PRO80518 

Figure 87: DNA323765, NM .000975, gen.NM .000975 
Figure 88: PRO80519 

Figure 89: DNA323766, NM .007260, gen.NM J0O726O 
Figure 90: PRO61250 

Figure 91: DNA323767,NM .01 7761, gen.NM .01 7761 
Figure 92: PRO80520 

Figure 93: DNA323768, NM.006625, gen.NM JD06625 
Figure 94: PR022196 

Figure 95: DNA323769, NM .0540 16, gen.NM J0540 16 
Figure 96: PRO80521 

Figure 97: DNA323770, XM .086375, gen.XM J086375 

Figure 98: DNA323771,XM .006290, gen.XM J006290 

Figure 99: DNA323772, NM .01 5484, gen.NM JO 15484 

Figure 100: PRO80524 

Figure 101 A-B: DNA323773,XM .001616, 

gen.XM.001616 

Figure 102: DNA323774.XM.058240, 
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gen.XM J058240 

Figure 103: DNA323775,XM.059117, 

gen.XM .0591 17 

Figure 104: PRO80527 

Figure 105: DNA226262, NM .005563, 

gen.NM .D05563 

Figure 106: PR036725 

Figure 107: DNA323776,NM .022778, 

gen.NMJ322778 

Figure 108: PRO80528 

Figure 109: DNA323777.XM.017846, 

gen.XM .017846 

Figure 110: DNA323778, NM.005517, 

gen.NM_005517 

Figure lll:PRO80530 

Figure 112A-C:DNA323779,XM.046918, 

gen.XM J046918 

Figure 113: DNA323780,XM .002114, 
gen.XMj002114 

Figure 114: DNA323781, XM .059066, 

gen.XM .059066 

Figure 115: PRO80533 

Figure 116: DNA323782, NM.018066, 

gen.NM .018066 

Figure 117: PRO80534 

Figure 118: DNA323783.NM .006600, 

gen.NMJ)06600 

Figure 119: PRO80535 

Figure 120: DNA323784,XM .059067, 

gen.XM.O59067 

Figure 121: PRO80536 

Figure 122: DNA323785, NM .032872, 

gen.NMj032872 

Figure 123: PRO80537 

Figure 124: DNA196349, NM.006990, 

gen.NM .006990 

Figure 125: PR024856 

Figure 126: DNA323788, XM.001640, 

gen.XM J001640 

Figure 127: DNA323789, NM.002946, 

gen.NM .002946 

Figure 128: PRO59099 

Figure 129: DNA323790, XM. 114044, 

gen.XM.l 14044 

Figure 130: DNA323791,XM .059088, 
gen.XM.059088 

Figure 131: DNA323792.NM.031459, 

gen.NM.031459 

Figure 132: PRO80542 

Figure 133: DNA323793, XM.010664, 

gen.XMJ)10664 

Figure 134: DNA323794,XM.001812, 
gen.XM .001812 

Figure 135: DNA323795, XM.001807, 
gen.XM.001807 

Figure 136: DNA323796, XM. 086444, 



gen JCM .086444 

Figure 137: DNA323797, NM .024640, 

gen.NM .024640 

Figure 138: PRO80547 

Figure 139A-B: DNA323798.XM.049310, 

gen.XM.049310 

Figure 140: DNA323799,XM.113374, 
gen.XM.l 13374 

Figure 141: DNA323800, XM.002105, 
gen.XM .002105 

Figure 142: DNA323801, NM .014571, 

gen.NM.014571 

Figure 143: PRO80550 

Figure 144: DNA323802, XM.165438, 

gen.XM.165438 

Figure 145: DNA323 803 , XMj 029844, 
gen.XM .029844 

Figure 146: DNA188748,NM.006559, 

gen.NM.006559 

Figure 147: PRO22304 

Figure 148: DNA323804, NM.003757, 

gen.NM .003757 

Figure 149: PRO80553 

Figure 150: DNA323805.NM .004964, 

gen.NM .004964 

Figure 151: PRO80554 

Figure 152: DNA323806, NM.023009, 

gen.NM .023009 

Figure 153: PRO80555 

Figure 154: DNA323 807, XM. 030423, 

gen.XM .030423 

Figure 155A-B: DNA323 808, XM .036299, 

gen.XM.036299 

Figure 156: PRO80557 

Figure 157: DNA22721 3, NM. 003680, 

gen.NM .003680 

Figure 158: PR037676 

Figure 159: DNA323809, NM.006112, 

gen.NM.006112 

Figure 160: PRO80558 

Figure 161: DNA323810, XM.018136, 

gen.XM .018136 

Figure 162: PRO80559 

Figure 163: DNA323811, XM.117184, 

gen.XM.117184 

Figure 164: PRO80560 

Figure 165: DNA323812,NM.017825, 

gen.NM_017825 

Figure 166: PRO80561 

Figure 167: DNA1 893 15, NM. 014408, 

gen.NM .014408 

Figure 168: PR022262 

Figure 169A-B: DNA323813.XM.029031, 

gen.XM .029031 

Figure 170: PRO80562 

Figure 171: DNA323814,XM.059171, 
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Figure 172: PRO80563 

Figure 173: DNA83085, NM .000760, gen. NM J000760 
Figure 174: PR02583 
Figure 175: DNA323815, XM ,165984, 
gen.XM.165984 

Figure 176: DNA323816,XM_029842, 
gen.XM .029842 
Figure 177: PR02851 
Figure 178: DNA323817,XM.086384, 
gen.XMj086384 
Figure 179: PRO80565 
Figure 180A-C: DNA274487,NM .014747, 
gen.NM .014747 
Figure 181: PR062389 
Figure 182: DNA323818,XM_010712, 
gen.XM .010712 

Figure 183: DNA323819, NM.024664, 
gen.NM .024664 
Figure 184: PRO80567 
Figure 185: DNA323820.XM.059214, 
gen.XM .059214 
Figure 186: PRO80568 
Figure 187: DNA323 82 1,XM. 046349, 
gen.XM .046349 

Figure 188: DNA103253,NM_006516, 
gen.NMj006516 
Figure 189: PR04583 
Figure 190: DNA323822, XM.086543, 
gen.XMJ)86543 
Figure 191: PRO80570 
Figure 192: DNA274745,NM .006824, 
gen.NM .006824 
Figure 193: PR062518 
Figure 194: DNA27 3060, NM .00 1255, 
gen.NM.0O1255 . 
Figure 195: PR061 125 
Figure 196: DNA323823,NM .030587, 
gen.NMJ)30587 
Figure 197: PRO80571 
Figure 198: DNA323 824, XM .097649, 
gen.XM .097649 

Figure 199: DNA256503,NM .003780, 
gen.NM .003780 
Figure 200: PR051539 
Figure 201: DNA323825, XM.046450, 
gen.XM .046450 

Figure 202A-B: DNA272024, NM .014663, 
gen.NM .014663 
Figure 203: PRO60298 
Figure 204: DNA323826, XM.046565, 
gen.XM .046565 
Figure 205: PRO80574 
Figure 206: DNA323827, NM .024602, 
gen.NMj024602 
Figure 207: PRO80575 
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Figure 208: DNA323828.XM.046557, 

gen.XM.046557 

Figure 209: PRO80576 

Figure 210: DNA323829, NM. 001012, 

gen.NM .001012 

Figure 211: PRO10760 

Figure 212: DNA323 83 0,XM .046551, 

gen.XM .046551 

Figure 213A-B: DNA323831, XM .027983, 
gen.XM .027983 

Figure 214: DNA323832.XMJ086324, 

gen.XM .086324 

Figure 215: PRO80579 

Figure 216: DNA323833.XM.032391, 

gen.XM .032391 

Figure 217: PRO80580 

Figure 218: DNA103214.NM.006066, 

gen.NM .006066 

Figure 219: PR04544 

Figure 220: DNA304686, NM. 002574, 

gen.NM .002574 

Figure 221: PR071112 

Figure 222: DNA323834, NM .032756, 

gen.NM .032756 

Figure 223: PRO80581 

Figure 224: DNA323835, XM.059133, 

gen.XM.059133 

Figure 225: PRO80582 

Figure 226: DNA323836, XM .0273 13, 

gen.XM.027313 

Figure 227: PRO80583 

Figure 228: DNA323837, XM .054868, 

gen.XM .054868 

Figure 229: DNA323838, NM .001262, 

gen.NM.001262 

Figure 230: PR059546 

Figure 231: DNA323839,XM.086391, 

gen.XM .086391 

Figure 232: PRO80584 

Figure 233: DNA323840, XM.l 14798, 

gen.XM .114798 

Figure 234: PRO80585 

Figure 235: DNA272748, NM .002979, 

gen.NM .002979 

Figure 236: PRO60860 

Figure 237: DNA323841,XM.038911, 

gen.XM .038911 

Figure 238: PRO80586 

Figure 239: DNA323842, NM.018070, 

gen.NM_018070 

Figure 240: PRO80587 

Figure 241: DNA323843, NM.024603, 

gen.NM .024603 

Figure 242: PRO80588 

Figure 243: DNA323844.XM.086389, 

gen.XM .086389 
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Figure 244: DNA323845,XM.038852, 
gen.XMj038852 

Figure 245: DNA323846, NM .032864, 

gen.NM .032864*^ 

Figure 246: PRO80591 

Figure 247: DNA323847, NM.024586, 

gen.NM.024586 

Figure 248: PRO80592 

Figure 249A-B: DNA323848, XM .097565, 

gen.XMj097565 

Figure 250: DNA323849, XM .00 1472, 
gen.XMJ)01472 

Figure 251A-C: DNA323850, XM.055481, 

gen.XM J055481 

Figure 252: PRO80593 

Figure 253: DNA323851, XM.010615, 

gen.XM .010615 

Figure 254A-B: DNA323852, XM_089138, 

gen.XM_089138 

Figure 255: PRO80595 

Figure 256A-B: DNA323853, XM .059180, 

gen.XM .059180 

Figure 257: DNA323854, XM.015717, 

gen.XMj015717 

Figure 258: PRO80597 

Figure 259: DNA323855,XM .114125, 

gen.XM.l 14125 

Figure 260: DNA323 856, NM .015640, 

gen.NM.015640 

Figure 261: PRO80599 

Figure 262: DNA323857, NM.017768, 

gen.NM J017768 

Figure 263: PRO80600 

Figure 264: DNA323858, XM.165977, 

gen.XM.165977 

Figure 265: DNA323859, XM.086343, 

gen.XM .086343 

Figure 266: PRO80602 

Figure 267: DNA269708, NM .007034, 

gen.NMJ007034 

Figure 268: PR058118 

Figure 269: DNA323860,NM.001554, 

gen.NM_001554 

Figure 270: PRO80603 

Figure 271: DNA226260, NM.006769, 

gen.NM .006769 

Figure 272: PR036723 

Figure 273: DNA323861,NM.004261, 

gen.NM .004261 

Figure 274: PRO80604 

Figure 275: DNA323862,XM_165983, 

gen.XM.165983 

Figure 276: DNA323863.XM.016164, 
gen.XM JH6164 

Figure 277: DNA323864,XM.086164, 
gen.XM .086164 



Figure 278: PRO80607 

Figure 279: DNA323 865, XM .086165, 

gen.XM.086165 

Figure 280: DNA323866, XMi)86167,~" 
gen.XM.086167 

Figure 281: DNA323867, XM.086166, 
gen.XM .086166 

Figure 282: DNA323868, XM.086138, 

gen.XM_086138 \ 

Figure 283: PRO8061 1 

Figure 284: DNA323 869, NMj 000969, 

gen.NM .000969 

Figure 285: PRO80612 

Figure 286: DNA323870, XMJ088863, 

gen.XM.088863 

Figure 287: PRO80613 

Figure 288: DNA27 1003, NM .003729, 

gen.NM.003729 

Figure 289: PR059332 

Figure 290: DNA323871, XM.165981, 

gen.XM_165981 

Figure 291: PRO80614 

Figure 292: DNA275139, NM.013296, 

gen.NM .013296 

Figure 293: PR062849 

Figure 294: DNA323872, XM.058702, 

gen.XM.058702 

Figure 295: DNA323873, XM.054978, 
gen.XM .054978 

Figure 296: DNA323874, NM J032636, 

gen.NM .032636 

Figure 297: PRO806 17 

Figure 298: DNA323875, NM.006513, 

gen.NM.006513 

Figure 299: PRO80618 

Figure 300: DNA323876, NM.006621, 

gen.NM.006621 

Figure 301: PRO80619 

Figure 302A-B: DNA323877, NM J007158, 

gen.NM .007 158 

Figure 303: PRO80620 

Figure 304: DNA323878, XM.086132, 

gen.XM.086132 

Figure 305: PRO80621 

Figure 306: DNA323879, NM.004000, 

gen.NM .004000 

Figure 307: PRO80622 

Figure 308: DNA323880,NMJ)01688, 

gen.NM .001688 

Figure 309: PRO80623 

Figure 310: DNA323881, NM JO 19099, 

gen.NM.019099 

Figure 311: PRO80624 

Figure 3 12A-B: DNA323882, NM.000701, 

gen.NM_000701 

Figure 313: PRO80625 
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Figure 314A-B: DNA323883, XM.018332, 
gen.XM .018332 

Figure 315A-B: DNA323884, XM.040709, 

gen.XM -040709 

Figure 316: PRO80627 

Figure 317: DNA323885, XM .086518, 

gen.XM-086518 

Figure 318A-D: DNA323886, XM .034671, 
gen.XM.034671 

Figure 319: DNA323887, XM_034662, 

gen.XM.034662 

Figure 320: PRO80630 

Figure 321: DNA323888, XM .039721, 

gen.XMj039721 

Figure 322: PRO80631 

Figure 323A-B: DNA323889, XM. 086397, 

gen.XM.086397 

Figure 324A-B: DNA323890, XM.086515, 

gen.XM J0865 15 

Figure 325: PRO80633 

Figure 326: DNA323891, XM.016480, 

gen.XM .016480 

Figure 327: DNA323892,XM.165975, 
gen.XM-165975 

Figure 328: DNA323893, NM .016361, 

gen.NM .016361 

Figure 329: PR0231 

Figure 330: DNA323894, XM .059210, 

gen.XMJ)59210 

Figure 331: DNA323895, XM .086296, 
gen.XM.086296 

Figure 332: DNA323896, NM .030920, 

gen.NM .030920 

Figure 333: PRO80638 

Figure 334: DNA323897, NM.016022, 

gen.NM .016022 

Figure 335: PRO80639 

Figure 336: DNA323898, NM.031901, 

gen.NM.031901 

Figure 337: PRO80640 

Figure 338A-B: DNA323899, XM .088788, 

gen.XMj088788 

Figure 339: PRO80641 

Figure 340: DNA274759, NM .005620, 

gen.NM .005620 

Figure 341: PR062529 

Figure 342: DNA323900,XM.001468, 

gen.XM -001468 

Figure 343: PR049642 

Figure 344: DNA323901, NM.006862, 

gen.NM.006862 

Figure 345: PRO80642 

Figure 346: DNA227529.NM .002796, 

gen.NM -002796 

Figure 347: PR037992 

Figure 348: DNA323902, NM .0028 10, 



gen.NM.002810 
Figure 349: PR061638 
Figure 350: DNA290284, NM .005997, 
gen.NM .005997 
Figure 351:PRO70433 
, Figure 352: DNA323903, XM .097639, 
gen.XM .097639 

Figure 353: DNA323904,XM.041879, 
gen.XM .041879 

Figure 354: DNA323905, XM.041884, 

gen.XM .041884 

Figure 355: PRO80644 

Figure 356: DNA225809, NM -000396, 

gen.NM .000396 

Figure 357: PR036272 

Figure 358: DNA323906,NM.025150, 

gen.NM .025150 

Figure 359: PRO80645 

Figure 360: DNA323907, XM .1 14098, 

gen.XM_l 14098 

Figure 361: DNA323908, XM.113369, 

gen.XM .113369 

Figure 362: PRO80646 

Figure 363: DNA323909, XM.099467, 

gen.XM .099467 

Figure 364: DNA323910,NM -002965, 

gen.NM .002965 

Figure 365: PRO80648 

Figure 366: DNA323911, XM.086400, 

gen.XM_086400 

Figure 367: DNA210134, NM.014624, 

gen.NM .014624 

Figure 368: PR033679 

Figure 369: DNA304666, NM.002961, 

gen.NM -002961 

Figure 370: PRO71093 

Figure 371: DNA304720, NM.019554, 

gen.NM.019554 

Figure 372: PR071146 

Figure 373: DNA323912, XM .165976, 

gen.XM .165976 

Figure 374: DNA227577, NM. 006271, 

gen.NM -006271 

Figure 375: PRO38040 

Figure 376: DNA3239 1 3, XM .114097, 

gen.XM .114097 

Figure 377: DNA3239 14, XM. 040009, 

gen.XM .040009 

Figure 378: PRO80651 

Figure 379: DNA323915, NM .024330, 

gen.NM.024330 

Figure 380: PRO703 

Figure 381: DNA3239 1 6, NM .012437, 

gen.NM.012437 

Figure 382: PRO80652 

Figure 383: DNA3239 1 7, XM -086271, 
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gen.XMj086271 

Figure 384: DNA323918.XM.1 14055, 

gen.XM.l 14055 

Figure 385: PR037535 

Figure 386: DNA323919, XM.l 13360, 

gen.XM.l 13360 

Figure 387: PRO80654 

Figure 388: DNA323920, XM.086564, 

gen.XM .086564 

Figure 389: DNA323921, NM .005973, 

gen.NM.005973 

Figure 390: PRO80656 

Figure 391: DNA323922, XM .044077, 

gen.XM .044077 

Figure 392: DNA323923, NM_001878, 

gen.NM .001878 

Figure 393: PRO80657 

Figure 394: DNA323924,NM -021948, 

gen.NM .021948 

Figure 395: PRO6018 

Figure 396: DNA273088, NM.006365, 

gen.NM J006365 

Figure 397: PR061 146 

Figure 398: DNA323925,XM.044127, 

gen.XM .044127 

Figure 399: PRO80658 

Figure 400: DNA323926, XM.053245, 

gen.XM .053245 

Figure 401: PRO80659 

Figure 402: DNA257916, NM .032323, 

gen.NM .032323 

Figure 403: PR052449 

Figure 404: DNA323927, NM. 005572, 

gen.NM .005572 

Figure 405: PRO80660 

Figure 406: DNA323928, XM.044166, 

gen.XM .044166 

Figure 407: PRO80661 

Figure 408: DNA323929, XM.044128, 

gen.XM.044128 

Figure 409: DNA226125, NM .003145, 

gen.NM .003145 

Figure 410: PR036588 

Figure 411A-B: DNA323930, XM.044172, 

gen.XM .044172 

Figure 412: DNA323931, NM .032292, 

gen.NM.032292 

Figure 413: PRO80664 

Figure 414: DNA323932, NM.004632, 

gen.NM .004632 

Figure 415: PRO80665 

Figure 416: DNA323933, XM.044075, 

gen.XM .044075 

Figure 417: PRO80666 

Figure 418: DNA323934, NM.018253, 

gen.NM .01 8253 



Figure 419: PRO80667 

Figure 420: DNA323935, NMJ018116, 

gen.NM.018116 

Figure 421: PRO80668 

Figure 422: DNA323936, NM J002004, 

gen.NM .002004 

Figure 423: PRO80669 

Figure 424: DNA323937, NM .005698, 

gen.NM .005698 

Figure 425: PRO80670 

Figure 426: DNA323938, NM J052837, 

gen.NM .052837 

Figure 427: PRO80671 

Figure 428: DNA194600, NM J006589, 

gen.NM .006589 

Figure 429: PR023942 

Figure 430: DNA323939,XM .086567, 

gen.XM .086567 

Figure 431: PRO80672 

Figure 432: DNA323940,XM .086552, 

gen.XM_086552 

Figure 433: DNA323941, XM. 036744, 
gen.XM .036744 

Figure 434: DNA323942, NM .130898, 

gen.NM_130898 

Figure 435 : PRO80675 

Figure 436: DNA226793, NM .006694, 

gen.NM .006694 

Figure 437: PR037256 

Figure 438: DNA294794, NM .002870, 

gen.NM.002870 

Figure 439: PRO70754 

Figure 440: DNA323943,NM .001030, 

gen.NM .001030 

Figure 441: PRO80676 

Figure 442: DNA323944, XM .036829, 

gen.XM.036829 

Figure 443: PRO80677 

Figure 444: DNA3 23945, NM .015449, 

gen.NM .01 5449 

Figure 445: PRO80678 ' 

Figure 446: DNA323946, NM. 014847, 

gen.NM .014847 

Figure 447: PRO80679 

Figure 448: DNA323947, XM. 036934, 

gen.XM.036934 

Figure 449: PRO80680 

Figure 450A-B: DNA323948, XM.036845, 

gen.XM .036845 

Figure 451: DNA323949,XM .010636, 
gen.XM_010636 

Figure 452: DNA323950, NM .006556, 

gen.NM .006556 

Figure 453: PR062574 

Figure 454: DNA323 95 l,XMj 034082, 

gen.XM_034082 
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Figure 455: DNA323952, NM .025207, 

gen.NM .025207 

Figure 456: PRO80684 

Figure 457: DNA 1 03436, NM. 0038 15, 

gen.NM .0038 15 

Figure 458: PR04763 

Figure 459: DNA323953,NM .003516, 

gen.NM .003516 

Figure 460: PRO80685 

Figure 461: DNA323954,NM.005850, 

gen.NMj005850 % 

Figure 462: PR059725 

Figure 463A-B: DNA323955, NM.014849, 

gen.NMj014849 

Figure 464: PRO80686 

Figure 465: DNA323956, XM.059094, 

gen.XM .059094 

Figure 466: DNA323957, XM. 058247, 

gen.XM .058247 

Figure 467: PRO80688 

Figure 468: DNA32395 8, NM_ 003779, 

gen.NM .003779 

Figure 469: PRO80689 

Figure 470: DNA323959, NM .004550, 

gen.NM .004550 

Figure 471: PR058974 

Figure 472: DNA323960, XM .085581, 

gen.XM .085581 

Figure 473: DNA323961, XM.l 13379, 
gen.XM .113379 

Figure 474: DNA2266 1 9, NM. 003564, 

gen.NM .003564 

Figure 475: PRO37082 

Figure 476A-B: DNA323962, XM .049680, 

gen.XM .049680 
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gen.NM .002084 

Figure 1705: PR081281 

Figure 1706: DNA324640, NM .01 8047, 

gen.NM.018047 

Figure 1707: PR081282 

Figure 1708: DNA324641.NM.005617, 
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Figure 1709: PRO10849 
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gen.XM .003789 

Figure 1713: DNA324645, XM .087652, 
gen.XM .087652 

Figure 1714: DNA324646, XM .068853, 
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Figure 1715: PR08 1286 

Figure 1716: DNA324647, XM.116465, 
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Figure 1717: PR081287 

Figure 1718: DNA3 02020, NM. 005573, 
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Figure 1719: PRO70993 
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gen.NM_014773 

Figure 1722: PR059913 
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Figure 1725: PR081289 
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Figure 1729: PR081291 
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gen.NM .003735 

Figure 1731: PR081292 
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gen.NM .003736 

Figure 1733: PR012416 

Figure 1734A-B: DNA324654, NM.018912, 
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Figure 1735: PRO36058 

Figure 1736A-B: DNA324655, NM.018913, 
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Figure 1738A-B: DNA324656, NM.018914, 
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Figure 1740A-B: DNA324657, NM.018915, 
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Figure 1741: PRO36020 

Figure 1742A-B: DNA324658, NM.018916, 

gen.NM .018916 

Figure 1743: PR081295 
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gen.NM .018917 
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Figure 1746A-B: DNA324660,NM.018918, 
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Figure 1747: PR081297 
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Figure 1749: PR081298 
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Figure 1751: PR081299 

Figure 1752A-B: DNA324663, NM.018921, 
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Figure 1773: PRO81309 
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Figure 1774: DNA324674, NM.032403, 

gen.NM _X)32403 

Figure 1775: PRO81310 

Figure 1776: DNA324675, NM .032402, 

gen.NM .032402 

Figure 1777: PR081311 

Figure 1778: DNA324676,XM.098387, 

gen.XM .098387 

Figure 1779: DNA324677, NM.002109, 

gen.NM .002109 

Figure 1780: PRO4908 

Figure 1781: DNA324678,XM.084180, 

gen.XM.084180 

Figure 1782: PR081313 
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gen.XM .039975 

Figure 1784: PR081314 

Figure 1785: DNA324680, NM .033551, 

gen.NM .033551 

Figure 1786: PR081315 

Figure 1787: DNA324681,NM .004821, 

gen.NM_004821 

Figure 1788: PR081316 

Figure 1789: DNA324682, XM.068395, 

gen.XM .068395 

Figure 1790: PR081317 
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gen.NM .004060 

Figure 1792: PR036881 

Figure 1793A-B: DNA324683, XMJ056963, 

gen.XM .056963 

Figure 1794: PR081318 

Figure 1795: DNA324684, NM.004219, 

gen.NM_004219 

Figure 1796: PR0813 19 

Figure 1797: DNA324685,XM .094243, 

gen.XM_094243 

Figure 1798A-B: DNA324686, XM.047964, 
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Figure 1799: DNA324687, XM.016345, 
gen.XM_016345 
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gen.NM_002887 
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gen.XM_166029 
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Figure 1805: DNA324691,XM.043340, 

gen.XM .043340 

Figure 1806: PR08 1325 

Figure 1807: DNA324692, XM.l 16340, 
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Figure 1808A-B: DNA324693, XM.043388, 
gen.XM.043388 
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gen.XMJL 16856 

Figure 1811: DNA324695,XM.003716, 
gen.XM_003716 

Figure 1812: DNA227320,NM_003714, 

gen.NM_003714 

Figure 1813: PR037783 

Figure 1814: DNA324696, NM .032361, 

gen.NM.032361 

Figure 1815: PRO81330 

Figure 1816: DNA324697,XM .087773, 

gen.XM .087773 

Figure 1817: DNA324698, XM.l 14457, 
gen.XM.l 14457 

Figure 1818: DNA324699, XM.l 65483, 
gen.XM_165483 

Figure 1819: DNA324700, XM.l 14453, 
gen.XM .114453 

Figure 1820: DNA3 24701, XM.l 65484, 
gen.XM .165484 

Figure 1821: DNA324702,XM .030771, 

gen.XM .030771 
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Figure 1823: DNA3 24703, XM. 030777, 
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Figure 1824: DNA324704, XM.030782, 
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gen.NM .000505 
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gen.NM_006816 
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Figure 1842: DNA3247 12, XM.l 66028, 
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Figure 1843: DNA324713.NM.015043, 

gen.NMj015043 

Figure 1844: PR081345 

Figure 1845: DNA324714,XM_1 13468, 

gen JCM.1 13468 

Figure 1846: DNA324715, NM .014275, 

gen.NM .014275 

Figure 1847: PR01927 

Figure 1848: DNA324716,NM.054013, 

genJNM .054013 

Figure 1849: PR081347 

Figure 1850: DNA270675,NM .005520, 

gen.NMJ)05520 

Figure 1851: PRO59040 
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Figure 1853: PR025849 
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Figure 1855: PRO58006 
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Figure 1857: DNA324719, XM.116511, 
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Figure 1858: DNA324720, XM.087823, 
gen.XM.087823 

Figure 1859A-C:DNA324721,XM_053955, 
gen.XM.053955 

Figure 1860: DNA324722, XM.113476 
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Figure 1861: DNA324723, XM. 11 65 14, 
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Figure 1862: DNA324724, XM.094741, 
gen.XM.094741 

Figure 1863: DNA324725,NM.025168, 

gen.NM .025 168 

Figure 1864: PR081354 

Figure 1865A-B: DNA324726, XM.165740, 

gen.XM_165740 

Figure 1866: DNA272171,NM_002388, 

gen.NMJ002388 

Figure 1867: PRO60438 

Figure 1868: DNA3 24727, XM .167 169, 

gen.XM.167169 

Figure 1869: PR081355 

Figure 1870: DNA324728, NM .014452, 

gen.NM .014452 

Figure 1871: PRO868 

Figure 1872: DNA324729,XM. 166349, 

gen.XM_166349 

Figure 1873: PR081356 

Figure 1874: DNA304680, NM .007355, 

gen.NM .007355 

Figure 1875: PR071 106 

Figure 1876: DNA324730,XM .165772, 

gen.XM .165772 



Figure 1877: DNA324731, XM.168123, 
gen.XM_168123 

Figure 1878: DNA324732, XM. 166457, 
gen.XM_166457 

Figure 1879: DNA324733,XM. 166469, 
gen.XM.166469 

Figure 1880: DNA324734,NM.018135, 

gen.NM.018135 

Figure 1881: PR08 1359 

Figure 1882A-B: DNA324735, XM.166340, 

gen.XM.166340 

Figure 1883: DNA324736, XM.087960, 
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Figure 1884: DNA324737.XM. 166362, 

gen.XM.166362 

Figure 1885: PR08 1362 

Figure 1886: DNA227204,NM.015388, 

gen.NM.015388 

Figure 1887: PR037667 

Figure 1888: DNA324738, XM.166425, 

gen.XM.166425 

Figure 1889: PR081363 

Figure 1890: DNA324739, NM.057161, 

gen.NM.057161 

Figure 1891: PR081364 
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gen.NM.006245 

Figure 1893: PR058984 
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gen.NM.006586 

Figure 1895: PR081365 

Figure 1896: DNA324741,XM. 166402, 

gen.XM_166402 

Figure 1897: PR081366 

Figure 1898: DNA324742, NM.001760, 

gen.NM_001760 

Figure 1899: PR081367 

Figure 1900: DNA287246,NM .004053, 

gen.NM.004053 

Figure 1901: PR069521 

Figure 1902: DNA324743,NM.017601, 
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Figure 1903; PR081368 
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Figure 1905: PR063253 

Figure 1906: DNA324744.NM .014341, 
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Figure 1908: DNA304460, NM .01 6059, 
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Figure 1911: PRO81370 
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Figure 1913: PR071142 

Figure 1914: DNA324746.XM. 166417, 

gen.XM .166417 

Figure 1915: PR081371 

Figure 1916A-B: DNA324747, NM_003137, 

gen.NM .003137 

Figure 1917: PR081372 

Figure 1918A-B: DNA324748, NM.004117, 
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Figure 1919: PR036841 

Figure 1920: DNA324749,XM .166419, 

gen.XM.166419 

Figure 1921: DNA324750, XM .165794, 
gen.XM .165794 
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gen.NM.007104 
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Figure 1924: DNA324752, NM.024294, 

gen.NM .024294 

Figure 1925: PR081375 

Figure 1926: DNA324753, NM.022758, 

gen.NM .022758 

Figure 1927: PRO50582 

Figure 1928: DNA324754, XM.168070, 

gen.XM .168070 

Figure 1929: DNA324755,NM_012391, 

gen.NM .012391 

Figure 1930: PR081377 

Figure 1931: DNA324756, XM .166459, 

gen.XM.166459 

Figure 1932: DNA324757, XM .166333, 

gen.XM.166333 

Figure 1933: PR081379 

Figure 1934: DNA324758, XM .058039, 

gen.XM.058039 

Figure 1935: PRO81380 

Figure 1936: DNA324759, XM.087990, 

gen.XM .087990 

Figure 1937: DNA324760, XM.165743, 
gen.XM.l 65743 

Figure 1938: DNA324761, XM.166360, 
gen.XM.166360 
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gen.XM .059801 
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Figure 1941: DNA324765, XM.016857, 
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Figure 1943: PRO37905 
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gen.NM .005452 

Figure 1945: PR081387 

Figure 1946: DNA304661,NM .022551, 



gen.NM .022551 
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Figure 1950: PR04884 

Figure 1951A-B: DNA324769, XM.165770, 
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Figure 1953: PRO69506 
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gen.XM_166480 
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Figure 1965: DNA324775,NM_021177, 

gen.NM.021177 

Figure 1966: PR081394 
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Figure 1970: PR069584 
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Figure 1982: PR081399 

Figure 1983: DNA324782,XM.165771, 

gen.XM_165771 

Figure 1984: DNA324783, NM.080598, 

gen.NM.080598 

Figure 1985: PR071 125 

Figure 1986: DNA304699, NM ,004640, 

gen.NM .004640 
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Figure 1989: PRO81400 
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Figure 1991: PRO81401 

Figure 1992: DNA324786, XM.166381, 
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Figure 1993: PRO81402 
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Figure 1995: DNA324788, XM.166401, 
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Figure 2003: PR01 112 
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Figure 2012: PRO81410 

Figure 2013: DNA324796, XM.165758, 

gen.XM.165758 

Figure 2014: PR081411 

Figure 2015: DNA324797, XM. 166406, 

gen.XM_l 66406 

Figure 2016: DNA324798, XM.165809, 
gen.XM_165809 

Figure 2017: DNA324799, NM.018950, 



gen.NM.018950 

Figure 2018: PR081414 

Figure 2019: DNA324800,XM.166392, 
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Figure 2020: PR081415 

Figure 2021: DNA324801,XM_166336, 
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Figure 2022: PR081416 

Figure 2023: DNA324802,XM.167128, 
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Figure 2025: DNA324803,XM.l 67161, 
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gen.NM .003472 

Figure 2035: PR012797 

Figure 2036A-B: DNA324807, XM.165728, 

gen.XM_165728 

Figure 2037: DNA324808, XM.165749, 

gen.XM.165749 
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Figure 2053: PRO81430 

Figure 2054A-B: DNA324818, XM .166042, 

gen.XM.166042 

Figure 2055: PR05 1389 

Figure 2056: DNA324819, XM.052721, 

gen.XMj052721 
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gen.XM.l 14497 
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Figure 2060: DNA324823, XM .094855, 
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Figure 2088: PR081449 
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gen.XM .087855 
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gen.XM .087853 
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gen.XM.165669 
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gen.XM .167037 

Figure 2097: PR081455 
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Figure 2105: PR059717 
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gen.XM .037056 
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Figure 6182: PR039127 

Figure 6183: DNA327041, XM .054098, 

gen.XMj054098 

Figure 6184: PR083341 

Figure 6185: DNA327042, NM .002668, 

gen.NM .002668 

Figure 6186: PR034584 

Figure 6187: DNA271580, NM .014008, 

gen.NM .014008 

Figure 6188: PR059868 

Figure 6189A-B: DNA327043, XM.032930, 

gen.XM_032930 

Figure 6190: DNA273992, NM.004493, 



gen.NM .004493 

Figure 6191: PR061938 

Figure 6192A-B: DNA327044, XM.050403, 

gen.XM.050403 

Figure 6193: PR083343 

Figure 6194: DNA327045, XM.029187 

gen.XM.029187 

Figure 6195: PR083344 

Figure 6196: DNA327046,XM.013060, 

gen.XM.013060 

Figure 6197: DNA227943.NM .006787, 

gen.NM .006787 

Figure 6198: PRO38406 

Figure 6199: DNA327047,NM .014481, 

gen.NM .014481 

Figure 6200: PR083345 

Figure 6201: DNA327048,XM.034935, 

gen.XM.034935 

Figure 6202: PR083346 

Figure 6203: DNA327049,XM .084287, 

gen.XM.084287 

Figure 6204: DNA327050, NM .007268, 

gen.NM .007268 

Figure 6205: PRO34043 

Figure 6206: DNA327051,XMj015516, 

gen.XM_015516 

Figure 6207 A-B: DNA327052, XM .013042, 

gen.XM .013042 

Figure 6208: PR083349 

Figure 6209: DNA327053, XM .088630, 

gen.XM.088630 

Figure 6210: DNA327054,NM_031206, 

gen.NM .031206 

Figure 6211: PR083351 

Figure 6212: DNA327055,XM .093050, 

gen.XM .093050 

Figure 6213: PR083352 

Figure 6214A-B: DNA225721,NM_018977, 

gen.NM .018977 

Figure 6215: PR036184 

Figure 6216: DNA327056, XM .010141, 

gen.XM.010141 

Figure 6217: PRO38021 

Figure 6218: DNA327057, XM .088689, 

gen.XM.088689 

Figure 6219: PR083353 

Figure 6220: DNA327058, XM .088688, 

gen.XM.088688 

Figure 6221: PR083354 

Figure 6222: DNA327059,NM_018486, 

gen.NM.018486 

Figure 6223: PR083355 

Figure 6224: DNA327060,NM.001007, 

gen.NM.001007 

Figure 6225: PRO42022 

Figure 6226: DNA327061,XM.093130, 
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genJCM.093130 

Figure 6227: DNA327062, XM.084296, 
gen.XM.084296 

Figure 6228: DNA327063, XM.093241, 
gen.XMJ)93241 

Figure 6229: DNA327064, XM .084283, 
gen.XM.084283 

Figure 6230: DNA273254, NM .000291, 

gen.NM .000291 

Figure 6231: PR061271 

Figure 6232: DNA327065, XM J018142, 

gen.XM .018142 

Figure 6233: DNA327066, XM.030373, 

gen.XM.030373 

Figure 6234: PRO83360 

Figure 6235: DNA327067, XM.165533, 

gen.XM .165533 

Figure 6236: PR083361 

Figure 6237: DNA327068, XM.051476, 

gen.XM.051476 

Figure 6238: DNA327069, XM.051471, 
gen.XM .051471 

Figure 6239: DNA270496, NM.001325, 

gen.NM .001325 

Figure 6240: PR058875 

Figure 6241: DNA327070, XM .033 147, 

gen.XM_033147 

Figure 6242: DNA327071, NM.004085, 

gen.NM .004085 

Figure 6243: PRO59022 

Figure 6244: DNA327072, NM .021029, 

gen.NMj021029 

Figure 6245: PRO10723 

Figure 6246: DNA327073, NM.012286, 

gen.NM .012286 

Figure 6247: PR083365 

Figure 6248: DNA327074, NM .024863, 

gen.NM.024863 

Figure 6249: PR083366 

Figure 6250: DNA327075, XM .043643, 

gen.XM.043643 

Figure 6251: DNA327076.NM .052936, 

gen.NM.052936 

Figure 6252: PR083368 

Figure 6253: DNA327077, XM.088710, 

gen.XM .088710 

Figure 6254: PROS3369 

Figure 6255: DNA327078, XM .166081, 

gen.XM .166081 

Figure 6256: DNA327079, XM .096303, 
gen.XM .096303 

Figure 6257: DNA254785, NM .032227, 

gen.NM .032227 

Figure 6258: PR049883 

Figure 6259: DNA327080, XM.115923, 

gen.XM.l 15923 



Figure 6260: PR083372 

Figure 6261: DNA327081,XM. 066900, 

gen.XM .066900 

Figure 6262: PR083373 

Figure 6263: DNA327082.XM .104983, 

gen.XM.104983 

Figure 6264: PR083374 

Figure 6265: DNA327083, XM.088736, 

gen.XM .088736 

Figure 6266: PR083375 

Figure 6267: DNA327084, XM .088738, 

gen.XM .088738 

Figure 6268: DNA327085,XM .088739, 
gen.XM J088739 

Figure 6269: DNA327086,XM.010117, 
gen.XM .0101 17 

Figure 6270A-B: DNA76504, NM.001560, 

gen.NMj001560 

Figure 6271: PR02537 

Figure 6272: DNA227 1 8 1,NM .006667, 

gen.NM.006667 

Figure 6273: PR037644 

Figure 6274: DNA327087,XM.010362, 

gen.XMj010362 

Figure 6275: DNA327088,XM.016125, 
gen.XM_016125 

Figure 6276: DNA327089, NM.015129, 

gen.NM .015129 

Figure 6277: PR083381 

Figure 6278: DNA327090, NM -001000, 

gen.NM .001000 

Figure 6279: PRO10935 

Figure 6280: DNA327091, XM.010436, 

gen.XM.010436 

Figure 6281: DNA327092,XM .115874, 
gen.XM.115874 

Figure 6282: DNA327093,XM.029461, 

gen.XM .029461 

Figure 6283: PR083383 

Figure 6284: DNA327094, XM .017930, 

gen.XM .017930 

Figure 6285: DNA22765 6, NM .004208, 

gen.NM.004208 

Figure 6286: PR038119 

Figure 6287: DNA273487, NM .004794, 

gen.NM_004794 

Figure 6288: PRO61470 

Figure 6289: DNA327095,XM.088745, 

gen.XM.088745 

Figure 6290: PR083385 

Figure 6291: DNA327096, XM.l 14708, 

gen.XM. 114708 

Figure 6292: PR083386 

Figure 6293: DNA327097, NM.016267, 

gen.NM .016267 

Figure 6294: PR083387 
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Figure 6295A-B: DNA327098, XM .042963, 

gen.XM .042963 

Figure 6296: PR083388 

Figure 6297: DNA327099, XM .042968, 

gen.XM.042968 

Figure 6298: PR083389 

Figure 6299: DNA327100, XM.093219, 

gen.XM.093219 

Figure 6300: DNA327101, NM.016249, 

gen.NMJ)16249 

Figure 6301: PR083391 

Figure 6302: DNA327102, XM.098995, 

gen.XM .098995 

Figure 6303: PR083392 

Figure 6304: DNA327103, XM .041921, 

gen.XM.041921 

Figure 6305: PR083393 

Figure 6306: DNA327104, XM .048905, 

gen.XM .048905 

Figure 6307: PR083394 

Figure 6308: DNA327105, NM.005364, 

gen.NM .005364 

Figure 6309: PR083395 

Figure 6310: DNA327106, XM.010178, 

gen.XM_010178 

Figure 6311: DNA327107, XM.088592, 

gen.XM.088592 

Figure 6312: PR025245 

Figure 6313: DNA327108, XM.018108, 

gen.XM .018108 

Figure 6314: PR083397 

Figure 6315: DNA327109, XM.018109, 

gen.XM.018109 

Figure 6316: DNA3271 10, NM.005362, 

gen.NM .005362 

Figure 6317: PRO24021 

Figure 6318: DNA254783, NM .001363, 

gen.NM.001363 

Figure 6319: PR049881 

Figure 6320: DNA3271 1 1, XM .049337, 

gen.XM .049337 

Figure 6321: DNA227917, NM .019848, 

gen.NM.019848 

Figure 6322: PRO38380 

Figure 6323: DNA3271 12, NM.004699, 

gen.NM.004699 

Figure 6324: PRO83400 

Figure 6325: DNA3271 13, XM .048420, 

gen.XM .048420 

Figure 6326: DNA3271 14, NM.006013, 



gen.NM.006013 

Figure 6327: PR062466 

Figure 6328: DNA3271 15, XM .048410, 

genJCM.048410 

Figure 6329A-C: DNA3271 16, XM.048404, 
gen.XM .048404 

Figure 6330A-C: DNA3271 17, NM .004992, 

gen.NM .004992 

Figure 6331: PRO83403 

Figure 6332: DNA227013, NM .001569, 

gen.NM.001569 

Figure 6333: PR037476 

Figure 6334A-B: DNA225800, NM .000425, 

gen.NM .000425 

Figure 6335: PR036263 

Figure 6336A-B: DNA3271 18, NM .024003, 

gen.NM .024003 

Figure 6337: PRO83404 

Figure 6338: DNA225655, NM.006280, 

gen.NM .006280 

Figure 6339: PR036118 

Figure 6340: DNA276159, NM .004135, 

gen.NM .004135 

Figure 6341: PR063299 

Figure 6342A-B: DNA230792, NM.000033, 

gen.NM.000033 

Figure 6343: PRO38730 

Figure 6344: DNA103558, NM .005745, 

gen.NM_005745 

Figure 6345: PR04885 

Figure 6346: DNA327119,XM_042155, 

gen.XM .042155 

Figure 6347: PRO83405 

Figure 6348: DNA327120, XM.042153, 

gen.XM_042153 

Figure 6349: DNA327121,XM.117555, 
gen.XM.l 17555 

Figure 6350: DNA327122, XM .08431 1, 
gen.XM.084311 

Figure 6351: DNA327 1 23, XM. 033232, 
gen.XM.033232 

Figure 6352: DNA327124,XM .117539, 
gen.XM.l 17539 

Figure 6353: DNA3271 25, XM .027952, 
gen.XM.027952 

Figure 6354: DNA327126, XM.l 14692, 
gen.XM.l 14692 

Figure 6355A-B: DNA327 127, XM.l 65530, 
gen.XM_165530 
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DNA Index (to Figure number) 



DNAO, 1188 

DNA103214,218 

DNA103217,649 

DNA103239,5576 

DNA103253, 188 

DNA103320.5272 

DNA103380, 1677 

DNA103401,4708 

DNA103421,2982 

DNA103436,457 

DNA103462, 5994 

DNA103471.2070 

DNA103474, 3313 

DNA1G3486, 5844 

DNA103505, 1149 

DNA103506.2990 

DNA103509,4110 

DNA103514, 3478 

DNA103525,5774 

DNA103558, 6344 

DNA103580, 5494 

DNA103588, 2274 

DNA103593,711 

DNA129504,4985 

DNA131588.2593 

DNA137231,3667 

DNA139747, 1368 

DNA144601,3051 

DNA150457.4936 

DNA150485.4305 

DNA150548.5703 

DNA150562, 1153 

DNA150679, 1732 

DNA150725,806 

DNA150767,572i 

DNA150772,2034 

DNA150784,5502 

DNA150814,4953 

DNA150884, 1024 

DNA150974, 3204 

DNA150976,1145 

DNA150978,3520 

DNA150997, 3526 

DNA151010,2546 

DNA151017, 1066 

DNA151148,44 

DNA151752,6020 

DNA151808.5476 

DNA151827, 3466 

DNA151831.4141 

DNA151882, 6005 

DNA1S1893.4079 

DNA151898.5896 



DNA171408.48 
DNA188229, 5836 
DNA188351,4782 
DNA188396, 3480 
DNA188732,5882 
DNA1 88740, 6027 
DNA188748, 146 
DNA189315, 167 
DNA1 89687, 3297 
DNA189697, 998 
DNA189703,4568 
DNA193882,585 
DNA193955,2193 
DNA193957.2947 
DNA194600.428 
DNA194701, 5747 
DNA194740, 854 
DNA194805,4530 
DNA194807,5760 
DNA194827.977 
DNA196344.576 
DNA196349, 124 
DNA19635 1,3600 
DNA196642,4877 
DNA210134.367 
DNA210180,3962 
DNA218271,5258 
DNA218841,2782 
DNA219225, 6075 
DNA219233,4182 
DNA225584, 1489 
DNA225592, 1330 
DNA225630, 2767 
DNA225631,2174 
DNA225632, 3473 
DNA225649,4042 
DNA225655,6338 
DNA225671,2506 
DNA225721.6214 
DNA225752, 3376 
DNA225800, 6334 
DNA225809, 356 
DNA225865, 3976 
DNA225909, 1828 
DNA225910, 1128 
DNA225919, 1446 
DNA225920, 1511 
DNA225921,1515 
DNA225954, 5947 
DNA226005, 553 
DNA22601 1,5517 
DNA226014, 3729 
DNA226028, 3489 
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DNA226080, 3206 

DNA226105.3992 

DNA226125.409 

DNA226217,3004 

DNA226260,271 

DNA226262, 105 

DNA226324,4095 

DNA226337.2458 

DNA226345,2670 

DNA226389, 4820 

DNA226409, 5921 

DNA226416,2262 

DNA226418, 1791 

DNA226428,741 

DNA226496,2565 

DNA226547,1108 

DNA226560, 2393 

DNA226561,5956 

DNA226617, 5935 

DNA226619,474 

DNA226646.4224 

DNA226758, 5745 

DNA226771,3498 

DNA226793,436 

DNA226853.3866 

DNA226872, 1689 

DNA227013.6332 

DNA227055,4939 

DNA227071,4889 

DNA227084,4742 

DNA227088, 3220 

DNA227092, 3593 

DNA227094, 3628 

DNA227165,684 

DNA227171,3724 

DNA227 172, 2964 

DNA227173, 1573 

DNA227181,6272 

DNA227190.814 

DNA227191.3588 

DNA227204, 1886 

DNA227206,4170 

DNA227213, 157 

DNA227234,4626 

DNA227246, 550 

DNA227249, 5352 

DNA227267, 2512 

DNA227268.2242 

DNA227280, 5232 

DNA227307, 1165 

DNA227320, 1812 

DNA22732 1,3984 

DNA227348,5681 

DNA227442, 1942 

DNA227472,5771 

DNA227474, 3720 



DNA227491.2691 
DNA227504.594 
DNA227509,3076 
DNA227528.803 
DNA227529, 346 
DNA227545, 698 
DNA227559,4161 
DNA227575, 1508 
DNA227577, 374 
DNA227607, 1961 
DNA227656,6285 
DNA227689.6147 
DNA227764,4891 
DNA227795,792 
DNA227821,36 
DNA227873,4841 
DNA227917, 6321 
DNA227924, 2099 
DNA227929,2206 
DNA227943,6197 
DNA230792, 6342 
DNA234442,4214 
DNA23793 1,6104 
DNA238039,6181 
DNA247474,578 
DNA247595,2182 
DNA251057,5515 
DNA252367, 1081 
DNA253804, 1370 
DNA254141,6003 
DNA254147, 1627 
DNA254165, 6068 
DNA254186, 3329 
DNA254198,4719 
DNA254204, 994 
DNA254240, 6045 
DNA254298, 499 
DNA254346, 603 
DNA254532,4487 
DNA254543,2740 
DNA254548, 5627 
DNA254572, 5885 
DNA254582, 1155 
DNA254620, 1316 
DNA254624, 3468 
DNA254771,2693 
DNA254777, 3777 
DNA254781,4374 
DNA254783, 6318 
DNA254785, 6257 
DNA254791,4898 
DNA254994, 5890 
DNA255046,5939 
DNA255078,3113 
DNA255340,4208 
DNA255370, 4265 
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DNA255414,4747 
DNA255531,859 
DNA255696,3109 
DNA256070.6101 
DNA256072.3511 
DNA256503, 199 
DNA256533.5513 
DNA256555.5146 
DNA256813, 5056 
DNA256836, 5387 
DNA256840, 5434 
DNA256844,4362 
DNA256886, 4370 
DNA256905, 545 
DNA257253, 1642 
DNA257309.2746 
DNA257428,4854 
DNA257511, 1437 
DNA25753 1,5506 
DNA257549.5965 
DNA257916,402 
DNA257965.3415 
DNA269431,3101 
DNA269481,5593 
DNA269498, 4059 
DNA269526, 5814 
DNA269593, 1854 
DNA269630, 5312 
DNA269708,267 
DNA269730,1195 
DNA269746, 5873 
DNA269793,6126 
DNA269803,3284 
DNA269809, 1687 
DNA269816, 1646 
DNA269830, 5989 
DNA269858, 1270 
DNA269894,5298 
DNA269910, 1062 
DNA269930, 1097 
DNA269952, 3093 
DNA270015,3864 
DNA270134,3208 
DNA270154.746 
DNA270254, 3896 
DNA270315, 5206 
DNA270401, 1099 
DNA270458,3591 
DNA270496,6239 
DNA270613, 1892 
DNA270615, 1386 
DNA270621, 5234 
DNA270675, 1850 
DNA270677, 3823 
DNA270697.6011 
DNA270711,2371 



DNA270721.3295 
DNA270901,4879 
DNA27093 1,5504 
DNA270954, 6079 
DNA270975.4843 
DNA270979,4805 
DNA27099 1,2662 
DNA271003,288 
DNA271010,5676 
DNA271040, 1997 
DNA271060, 751 
DNA271171,4507 
DNA271187, 1093 
DNA271243, 703 
DNA271324, 3380 
DNA271344, 3550 
DNA271418,2104 
DNA271492, 3727 
DNA271580,6187 
DNA271608, 934 
DNA271626, 1721 
DNA271722, 2751 
DNA271841,5052 
DNA271843,3392 
DNA27 1847, 2660 
DNA271931, 1697 
DNA271986.519 
DNA272024, 202 
DNA272050, 2600 
DNA272062,5625 
DNA272090,2348 
DNA272127,881 
DNA272171, 1866 
DNA272213,2734 
DNA272263, 1967 
DNA272347, 5426 
DNA272379, 3555 
DNA272413, 3390 
DNA272421,5201 
DNA272605, 1335 
DNA272655, 2714 
DNA272728, 3215 
DNA272748.235 
DNA272889,4812 
DNA273014,4267 
DNA273060, 194 
DNA273066, 5568 
DNA273088, 396 
DNA273254, 6230 
DNA273320, 5785 
DNA273346, 5615 
DNA273474,5421 
DNA273487, 6287 
DNA273517,5738 
DNA273521,3066 
DNA273600, 5448 
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DNA273694, 5023 
DNA273712,42 
DNA273759.2899 
DNA273800, 689 
DNA273839.4360 
DNA273865,2246 
DNA273919, 1182 
DNA273992, 6190 
DNA274002, 4476 
DNA274034, 5277 
DNA274058,3912 
DNA274101.5115 
DNA274129, 5892 
DNA274139, 5441 
DNA274178, 2491 
DNA274180,4516 
DNA274206, 1830 
DNA274289, 5523 
DNA274326.2176 
DNA274361,3763 
DNA274487, 180 
DNA274690, 5039 
DNA274745, 192 
DNA274755,4975 
DNA274759, 340 
DNA274761.5199 
DNA274823, 5542 
DNA274829, 6149 
DNA275049,662 
DNA275066,744 
DNA275139, 292 
DNA275144,4300 
DNA275181.4320 
DNA275195.651 
DNA275240, 864 
DNA275322, 2723 
DNA275334, 2232 
DNA275408,4564 
DNA275630, 1904 
DNA276159, 6340 
DNA281436,3900 
DNA287167, 794 
DNA287173.31 
DNA287 189, 2265 
DNA287216,2701 
DNA287227, 1952 
DNA287234, 5014 
DNA287237, 3008 
DNA287240, 5328 
DNA287243, 5279 
DNA287246, 1900 
DNA287254.3236 
DNA287261.5668 
DNA287270,5654 
DNA28727 1,2763 
DNA287282, 1582 



DNA287290,5685 
DNA287291,4919 
DNA287319, 1969 
DNA28733 1,4242 
DNA287355.4520 
DNA287417, 3218 
DNA287425,4900 
DNA287427,4778 
DNA287636.5154 
DNA287642.2951 
DNA288247, 2703 
DNA288259, 1598 
DNA289522,4446 
DNA289530,2761 
DNA290231,1638 
DNA290234,540 
DNA290259, 6034 
DNA290260,5550 
DNA290264, 2007 
DNA290284, 350 
DNA290292, 4728 
DNA290294, 3620 
DNA290319,2680 
DNA290585, 1459 
DNA290785,2032 
DNA294794,438 
DNA297288, 5638 
DNA297388.4699 
DNA297398,3434 
DNA299899, 930 
DNA302016,3827 
DNA302020, 1718 
DNA304459, 2986 
DNA304460, 1908 
DNA304488, 2996 
DNA304658, 5562 
DNA304661, 1946 
DNA304662, 5640 
DNA304666,369 
DNA304668, 1963 
DNA304669, 3887 
DNA304670, 5830 
DNA304680, 1874 
DNA304685,2435 
DNA304686,220 
DNA304694, 3717 
DNA304699, 1986 
DNA304704.4575 
DNA304707, 2254 
DNA304710, 2308 
DNA304715,4714 
DNA304716, 1912 
DNA304719, 6038 
DNA304720, 371 
DNA304783, 3631 
DNA304801,2342 
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DNA304805, 905 
DNA304835,5973 
DNA323717, 1 
DNA323718.2 
DNA323719,3 
DNA323720,4 
DNA323721.6 
DNA323722, 8 
DNA323723, 10 
DNA323724, 12 
DNA323725, 14 
DNA323726, 15 
DNA323727, 17 
DNA323728, 19 
DNA323729, 20 
DNA323730, 22 
DNA323731.24 
DNA323732,26 
DNA323733, 28 
DNA323734.29 
DNA323735, 33 
DNA323736,34 
DNA323737, 38 
DNA323738,40 
DNA323739,41 
DNA323740,46 
DNA323741,50 
DNA323742, 52 
DNA323743, 54 
DNA323744, 55 
DNA323745, 57 
DNA323746, 58 
DNA323747, 59 
DNA323748,60 
DNA323749, 62 
DNA323750, 64 
DNA323751,66 
DNA323752, 67 
DNA323753, 68 
DNA323754, 69 
DNA323755.71 
DNA323756,73 
DNA323757,75 
DNA323758, 76 
DNA323759, 77 
DNA323760,78 
DNA323761,79 
DNA323762,81 
DNA323763, 83 
DNA323764, 85 
DNA323765, 87 
DNA323766, 89 
DNA323767,91 
DNA323768, 93 
DNA323769, 95 
DNA323770,97 



DNA323771,98 
DNA323772.99 
DNA323773, 101 
DNA323774, 102 
DNA323775, 103 
DNA323776, 107 
DNA323777, 109 
DNA323778, 110 
DNA323779,112 
DNA323780, 113 
DNA32378 1,114 
DNA323782.116 
DNA323783,118 
DNA323784, 120 
DNA323785, 122 
DNA323788, 126 
DNA323789, 127 
DNA323790, 129 
DNA323791, 130 
DNA323792, 131 
DNA323793, 133 
DNA323794, 134 
DNA323795, 135 
DNA323796, 136 
DNA323797, 137 
DNA323798, 139 
DNA323799, 140 
DNA323800, 141 
DNA323801, 142 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
L Definitions 

The terms "TAT polypeptide" and "TAT" as used herein and when immediately followed by a 
numerical designation, refer to various polypeptides, wherein the complete designation (i.e. ,TAT/number) refers 
to specific polypeptide sequences as described herein. The terms a TAT/number polypeptide" and 
5 "TAT/number" wherein the term "number" is provided as an actual numerical designation as used herein 

encompass native sequence polypeptides, polypeptide variants and fragments of native sequence polypeptides 
and polypeptide variants (which are further defined herein). The TAT polypeptides described herein may be 
isolated from a variety of sources, such as from human tissue types or from another source, or prepared by 
recombinant or synthetic methods. The term "TAT polypeptide" refers to each individual TAT/number 

10 polypeptide disclosed herein. All disclosures in this specification which refer to the "TAT polypeptide" refer 
to each of the polypeptides individually as well as jointly. For example, descriptions of the preparation of, 
purification of, derivation of, formation of antibodies to or against, formation of TAT binding oligopeptides to 
or against, formation of TAT binding organic molecules to or against, administration of, compositions 
containing, treatment of a disease with, etc. , pertain to each polypeptide of the invention individually. The term 

1 5 "TAT polypeptide" also includes variants of the TAT/number polypeptides disclosed herein. 

A "native sequence TAT polypeptide" comprises a polypeptide having the same amino acid sequence 
as the corresponding TAT polypeptide derived from nature. Such native sequence TAT polypeptides can be 
isolated from nature or can be produced by recombinant or synthetic means. The term "native sequence TAT 
polypeptide" specifically encompasses naturally-occurring truncated or secreted forms of the specific TAT 

20 polypeptide (e.g., an extracellular domain sequence), naturally-occurring variant forms (e.g., alternatively 
spliced forms) and naturally-occurring allelic variants of the polypeptide. In certain embodiments of the 
invention, the native sequence TAT polypeptides disclosed herein are mature or full-length native sequence 
polypeptides comprising the full-length amino acids sequences shown in the accompanying figures. Start and 
stop codons (if indicated) are shown in bold font and underlined in the figures. Nucleic acid residues indicated 

25 as "N" in the accompanying figures are any nucleic acid residue. However, while the TAT polypeptides 

disclosed in the accompanying figures are shown to begin with methionine residues designated herein as amino 
acid position 1 in the figures, it is conceivable and possible that other methionine residues located either 
upstream or downstream from the amino acid position 1 in the figures may be employed as the starting amino 
acid residue for the TAT polypeptides. 

30 The TAT polypeptide "extracellular domain" or "ECD" refers to a form of the TAT polypeptide which 

is essentially free of the transmembrane and cytoplasmic domains. Ordinarily, a TAT polypeptide ECD will 
have less than 1 % of such transmembrane and/or cytoplasmic domains and preferably, will have less than 0.5 % 
of such domains. It will be understood that any transmembrane domains identified for the TAT polypeptides 
of the present invention are identified pursuant to criteria routinely employed in the art for identifying that type 

35 of hydrophobic domain. The exact boundaries of a transmembrane domain may vary but most likely by no more 
than about 5 amino acids at either end of the domain as initially identified herein. Optionally, therefore, an 
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extracellular domain of a TAT polypeptide may contain from about 5 or fewer amino acids on either side of the 
transmembrane domain/extracellular domain boundary as identified in the Examples or specification and such 
polypeptides, with or without the associated signal peptide, and nucleic acid encoding them, are contemplated 
by the present invention. 

The approximate location of the "signal peptides" of the various TAT polypeptides disclosed herein 
may be shown in the present specification and/or the accompanying figures. It is noted, however, that the C- 
terminal boundary of a signal peptide may vary, but most likely by no more than about 5 amino acids on either 
side of the signal peptide C-terminal boundary as initially identified herein, wherein the C-terminal boundary 
of the signal peptide may be identified pursuant to criteria routinely employed in the art for identifying that type 
of amino acid sequence element (e.g., Nielsen et al., Prot. Eng. 10:1-6 (1997) and von Heinje et al., Nucl. 
Acids. Res. 14:4683-4690 (1986)). Moreover, it is also recognized that, in some cases, cleavage of a signal 
sequence from a secreted polypeptide is not entirely uniform, resulting in more than one secreted species. These 
mature polypeptides, where the signal peptide is cleaved within no more than about 5 amino acids on either side 
of the C-terminal boundary of the signal peptide as identified herein, and the polynucleotides encoding them, 
are contemplated by the present invention. 

TAT polypeptide variant" means a TAT polypeptide, preferably an active TAT polypeptide, as defined 
herein having at least about 80% amino acid sequence identity with a full-length native sequence TAT 
polypeptide sequence as disclosed herein, a TAT polypeptide sequence lacking the signal peptide as disclosed 
herein, an extracellular domain of a TAT polypeptide, with or without the signal peptide, as disclosed herein 
or any other fragment of a full-length TAT polypeptide sequence as disclosed herein (such as those encoded by 
a nucleic acid that represents only a portion of the complete coding sequence for a full-length TAT polypeptide) . 
Such TAT polypeptide variants include, for instance, TAT polypeptides wherein one or more amino acid 
residues are added, or deleted, at the N- or C-terminus of the full-length native amino acid sequence. 
Ordinarily, a TAT polypeptide variant will have at least about 80% amino acid sequence identity, alternatively 
at least about 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 
97%, 98%, or 99% amino acid sequence identity, to a full-length native sequence TAT polypeptide sequence 
as disclosed herein, a TAT polypeptide sequence lacking the signal peptide as disclosed herein, an extracellular 
domain of a TAT polypeptide, with or without the signal peptide, as disclosed herein or any other specifically 
defined fragment of a full-length TAT polypeptide sequence as disclosed herein. Ordinarily, TAT variant 
polypeptides are at least about 10 amino acids in length, alternatively at least about 20, 30, 40, 50, 60, 70, 80, 
90, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, 210, 220, 230, 240, 250, 260, 270, 280, 290, 300, 
310, 320, 330, 340, 350, 360, 370, 380, 390, 400, 410, 420, 430, 440, 450, 460, 470, 480, 490, 500, 510, 
520, 530, 540, 550, 560, 570, 580, 590, 600 amino acids in length, or more. Optionally, TAT variant 
polypeptides will have no more than one conservative amino acid substitution as compared to the native TAT 
polypeptide sequence, alternatively no more than 2, 3, 4, 5, 6, 7, 8, 9, or 10 conservative amino acid 
substitution as compared to the native TAT polypeptide sequence. 

"Percent (%) amino acid sequence identity" with respect to the TAT polypeptide sequences identified 
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herein is defined as the percentage of amino acid residues in a candidate sequence that are identical with the 
amino acid residues in the specific TAT polypeptide sequence, after aligning the sequences and introducing gaps, 
if necessary, to achieve the maximum percent sequence identity, and not considering any conservative 
substitutions as part of the sequence identity. Alignment for purposes of detemnning percent amino acid 
sequence identity can be achieved in various ways that are within the skill in the art, for instance, using publicly 
5 available computer software such as BLAST, BLAST-2, ALIGN or Megalign (DNASTAR) software. Those 
skilled in the art can determine appropriate parameters for measuring alignment, including any algorithms 
needed to achieve maximal alignment over the full length of the sequences being compared. For purposes 
herein, however, % amino acid sequence identity values are generated using the sequence comparison computer 
program ALIGN-2, wherein the complete source code for the ALIGN-2 program is provided in Table 1 below. 

10 The ALIGN-2 sequence comparison computer program was authored by Genentech, Inc. and the source code 
shown in Table 1 below has been filed with user documentation in the U.S. Copyright Office, Washington D .C. , 
20559, where it is registered under U.S. Copyright Registration No. TXU510087. The ALIGN-2 program is 
publicly available through Genentech, Inc., South San Francisco, California or may be compiled from the source 
code provided in Table 1 below. The ALIGN-2 program should be compiled for use on a UNIX operating 

1 5 system, preferably digital UNIX V4.0D. All sequence comparison parameters are set by the ALIGN-2 
program and do not vary. 

In situations where ALIGN-2 is employed for amino acid sequence comparisons, the % amino acid 
sequence identity of a given amino acid sequence A to, with, or against a given amino acid sequence B (which 
can alternatively be phrased as a given amino acid sequence A that has or comprises a certain % amino acid 
20 sequence identity to, with, or against a given amino acid sequence B) is calculated as follows: 

100 times the fraction X/Y 

where X is the number of amino acid residues scored as identical matches by the sequence alignment program 
25 ALIGN-2 in that program's alignment of A and B, and where Y is the total number of amino acid residues in 
B. It will be appreciated that where the length of amino acid sequence A is not equal to the length of amino acid 
sequence B, the % amino acid sequence identity of A to B will not equal the % amino acid sequence identity 
of B to A. As examples of % amino acid sequence identity calculations using this method, Tables 2 and 3 
demonstrate how to calculate the % amino acid sequence identity of the amino acid sequence designated 
30 "ComparisonProtein" to the amino acid sequence designated "TAT", wherein "TAT" represents the amino acid 
sequence of a hypothetical TAT polypeptide of interest, "Comparison Protein" represents the amino acid 
sequence of a polypeptide against which the "TAT" polypeptide of interest is being compared, and "X, " Y" and 
"Z" each represent different hypothetical amino acid residues. Unless specifically stated otherwise, all % amino 
acid sequence identity values used herein are obtained as described in the immediately preceding paragraph using 
35 the ALIGN-2 computer program. 

"TAT variant polynucleotide" or "TAT variant nucleic acid sequence" means a nucleic acid molecule 
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which encodes a TAT polypeptide, preferably an active TAT polypeptide, as defined herein and which has at 
least about 80% nucleic acid sequence identity with a nucleotide acid sequence encoding a full-length native 
sequence TAT polypeptide sequence as disclosed herein, a full-length native sequence TAT polypeptide sequence 
lacking the signal peptide as disclosed herein, an extracellular domain of a TAT polypeptide, with or without 
the signal peptide, as disclosed herein or any other fragment of a full-length TAT polypeptide sequence as 
disclosed herein (such as those encoded by a nucleic acid that represents only a portion of the complete coding 
sequence for a full-length TAT polypeptide). Ordinarily, a TAT variant polynucleotide will have at least about 
80% nucleic acid sequence identity, alternatively at least about 81 %, 82%, 83 %, 84%, 85%, 86%, 87% 88% 
89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% nucleic acid sequence identity with a 
nucleic acid sequence encoding a full-length native sequence TAT polypeptide sequence as disclosed herein, a 
full-length native sequence TAT polypeptide sequence lacking the signal peptide as disclosed herein, an 
extracellular domain of a TAT polypeptide, with or without the signal sequence, as disclosed herein or any other 
fragment of a full-length TAT polypeptide sequence as disclosed herein. Variants do not encompass the native 
nucleotide sequence. 

Ordinarily, TAT variant polynucleotides are at least about 5 nucleotides in length, alternatively at least 
about 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22. 23, 24, 25, 26, 27, 28, 29, 30, 35, 40, 45, 
50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 105, 110, 115, 120, 125, 130, 135, 140, 145, 150, 155, 160, 165, 
170, 175, 180, 185, 190, 195, 200, 210, 220, 230, 240, 250, 260, 270, 280, 290, 300, 310, 320, 330, 34 0 ' 
350, 360, 370, 380, 390, 400, 410, 420, 430, 440, 450, 460, 470, 480, 490, 500, 510, 520, 530, 540, 55 0 ' 
560, 570, 580, 590, 600, 610, 620, 630, 640, 650, 660, 670, 680, 690, 700, 710, 720, 730, 740, 750, 760,' 
770, 780, 790, 800, 810, 820, 830, 840, 850, 860, 870, 880, 890, 900, 910, 920, 930, 940, 950, 960, 970,' 
980, 990, or 1000 nucleotides in length, wherein in this context the term "about" means the referenced 
nucleotide sequence length plus or minus 10% of that referenced length. 

"Percent (%) nucleic acid sequence identity" with respect to TAT-encoding nucleic acid sequences 
identified herein is defined as the percentage of nucleotides in a candidate sequence that are identical with the 
nucleotides in the TAT nucleic acid sequence of interest, after aligning the sequences and introducing gaps, if 
necessary, to achieve the maximum percent sequence identity. Alignment for purposes of detenmning percent 
nucleic acid sequence identity can be achieved in various ways that are within the skill in the art, for instance, 
using publicly available computer software such as BLAST, BLAST-2, ALIGN or Megalign (DNASTAR) 
software. For purposes herein, however, % nucleic acid sequence identity values are generated using the 
sequence comparison computer program ALIGN-2, wherein the complete source code for the ALIGN-2 
program is provided in Table 1 below. The ALIGN-2 sequence comparison computer program was authored 
by Genentech, Inc. and the source code shown in Table 1 below has been filed with user documentation in the 
U.S. Copyright Office, Washington D.C., 20559, where it is registered under U.S. Copyright Registration No. 
TXU510087. The ALIGN-2 program is publicly available through Genentech, Inc., South San Francisco, 
California or may be compiled from the source code provided in Table 1 below. The ALIGN-2 program should 
be compiled for use on a UNIX operating system, preferably digital UNIX V4.0D. All sequence comparison 
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parameters are set by the ALIGN-2 program and do not vary. 

In situations where ALIGN-2 is employed for nucleic acid sequence comparisons, the % nucleic acid 
sequence identity of a given nucleic acid sequence C to, with, or against a given nucleic acid sequence D (which 
can alternatively be phrased as a given nucleic acid sequence C that has or comprises a certain % nucleic acid 
sequence identity to, with, or against a given nucleic acid sequence D) is calculated as follows: 

5 

100 times the fraction W/Z 

where W is the number of nucleotides scored as identical matches by the sequence alignment program ALIGN-2 
in that program's alignment of C and D, and where Z is the total number of nucleotides in D. It will be 

10 appreciated that where the length of nucleic acid sequence C is not equal to the length of nucleic acid sequence 
D, the % nucleic acid sequence identity of C to D will not equal the % nucleic acid sequence identity of D to 
C. As examples of % nucleic acid sequence identity calculations, Tables 4 and 5, demonstrate how to calculate 
the % nucleic acid sequence identity of the nucleic acid sequence designated "Comparison DNA" to the nucleic 
acid sequence designated "TAT-DNA", wherein "TAT-DNA" represents a hypothetical TAT-encoding nucleic 

15 acid sequence of interest, "Comparison DNA" represents the nucleotide sequence of a nucleic acid molecule 
against which the "TAT-DNA" nucleic acid molecule of interest is being compared, and "N", "L" and "V" each 
represent different hypothetical nucleotides. Unless specifically stated otherwise, all % nucleic acid sequence 
identity values used herein are obtained as described in the immediately preceding paragraph using the ALIGN-2 
computer program. 

20 In other embodiments, TAT variant polynucleotides are nucleic acid molecules that encode a TAT 

polypeptide and which are capable of hybridizing, preferably under stringent hybridization and wash conditions, 
to nucleotide sequences encoding a full-length TAT polypeptide as disclosed herein. TAT variant polypeptides 
may be those that are encoded by a TAT variant polynucleotide. 

The term "full-length coding region" when used in reference to a nucleic acid encoding a TAT 

25 polypeptide refers to the sequence of nucleotides which encode the full-length TAT polypeptide of the invention 
(which is often shown between start and stop codons, inclusive thereof, in the accompanying figures). The term 
"full-length coding region" when used in reference to an ATCC deposited nucleic acid refers to the TAT 
polypeptide-encoding portion of the cDNA that is inserted into the vector deposited with the ATCC (which is 
often shown between start and stop codons, inclusive thereof, in the accompanying figures). 

30 "Isolated," when used to describe the various TAT polypeptides disclosed herein, means polypeptide 

that has been identified and separated and/or recovered from a component of its natural environment. 
Contaminant components of its natural environment are materials that would typically interfere with diagnostic 
or therapeutic uses for the polypeptide, and may include enzymes, hormones, and other proteinaceous or non- 
proteinaceous solutes. In preferred embodiments, the polypeptide will be purified (1) to a degree sufficient to 

35 obtain at least 15 residues of N-tenninal or internal amino acid sequence by use of a spinning cup sequenator, 
or (2) to homogeneity by SDS-PAGE under non-reducing or reducing conditions using Coomassie blue or, 
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preferably, silver stain. Isolated polypeptide includes polypeptides situ within recombinant cells, since at least 
one component of the TAT polypeptide natural environment will not be present. Ordinarily, however, isolated 
polypeptide will be prepared by at least one purification step. 

An "isolated" TAT polypeptide-encoding nucleic acid or other polypeptide-encoding nucleic acid is a 
nucleic acid molecule that is identified and separated from at least one contaminant nucleic acid molecule with 
5 which it is ordinarily associated in the natural source of the polypeptide-encoding nucleic acid. An isolated 
polypeptide-encoding nucleic acid molecule is other than in the form or setting in which it is found in nature. 
Isolated polypeptide-encoding nucleic acid molecules therefore are distinguished from the specific polypeptide- 
encoding nucleic acid molecule as it exists in natural cells. However, an isolated polypeptide-encoding nucleic 
acid molecule includes polypeptide-encoding nucleic acid molecules contained in cells that ordinarily express 
10 the polypeptide where, for example, the nucleic acid molecule is in a chromosomal location different from that 
of natural cells. 

The term "control sequences" refers to DNA sequences necessary for the expression of an operably 
linked coding sequence in a particular host organism. The control sequences that are suitable for prokaryotes, 
for example, include a promoter, optionally an operator sequence, and a ribosome binding site. Eukaryotic cells 

15 are known to utilize promoters, polyadenylation signals, and enhancers. 

Nucleic acid is "operably linked" when it is placed into a functional relationship with another nucleic 
acid sequence. For example, DNA for a presequence or secretory leader is operably linked to DNA for a 
polypeptide if it is expressed as a preprotein that participates in the secretion of the polypeptide; a promoter or 
enhancer is operably linked to a coding sequence if it affects the transcription of the sequence; or a ribosome 

20 binding site is operably linked to a coding sequence if it is positioned so as to facilitate translation. Generally, 
"operably linked" means that the DNA sequences being linked are contiguous, and, in the case of a secretory 
leader, contiguous and in reading phase. However, enhancers do not have to be contiguous. Linking is 
accomplished by ligation at convenient restriction sites. If such sites do not exist, the synthetic oligonucleotide 
adaptors or linkers are used in accordance with conventional practice. 

25 "Stringency" of hybridization reactions is readily determinable by one of ordinary skill in the art, and 

generally is an empirical calculation dependent upon probe length, washing temperature, and salt concentration. 
In general, longer probes require higher temperatures for proper annealing, while shorter probes need lower 
temperatures. Hybridization generally depends on the ability of denatured DNA to reanneal when 
complementary strands are present in an environment below their melting temperature. The higher the degree 

30 of desired homology between the probe and hybridizable sequence, the higher the relative temperature which 
can be used. As a result, it follows that higher relative temperatures would tend to make the reaction conditions 
more stringent, while lower temperatures less so. For additional details and explanation of stringency of 
hybridization reactions, see Ausubel et aL, Current Protocols in Molecular Biology, Wiley Interscience 
Publishers, (1995). 

35 "Stringent conditions" or "high stringency conditions", as defined herein, may be identified by those 

that: (1) employ low ionic strength and high temperature for washing, for example 0.015 M sodium 
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chloride/0.0015 M sodium citrate/0.1% sodium dodecyl sulfate at 50°C; (2) employ during hybridization a 
denaturing agent, such as formamide, for example, 50% (v/v) formamide with 0.1% bovine serum 
albumin/0. 1% Ficoll/0.1% polyvinylpyrrolidone/50mM sodium phosphate buffer at pH 6.5 with 750 mM 
sodium chloride, 75 mM sodium citrate at 42°C; or (3) overnight hybridization in a solution that employs 50% 
formamide, 5 x SSC (0.75 M NaCl, 0.075 M sodium citrate), 50 mM sodium phosphate (pH 6.8), 0. 1 % sodium 
pyrophosphate, 5 x Denhardt's solution, sonicated salmon sperm DNA (50 jig/ml), 0. 1 % SDS, and 10% dextran 
sulfate at 42°C, with a 10 minute wash at 42°C in 0.2 x SSC (sodium chloride/sodium citrate) followed by a 
10 minute high-stringency wash consisting of 0. 1 x SSC containing EDTA at 55 °C. 

"Moderately stringent conditions" may be identified as described by Sambrook et al., Molecular 
Cloning: A Laboratory Manual, New York: Cold Spring Harbor Press, 1989, and include the use of washing 
solution and hybridization conditions (e.g., temperature, ionic strength and %SDS) less stringent that those 
described above. An example of moderately stringent conditions is overnight incubation at 37°Cin a solution 
comprising: 20% formamide, 5 x SSC (150 mM NaCl, 15 mM trisodium citrate), 50 mM sodium phosphate 
(pH 7.6), 5 x Denhardt's solution, 10% dextran sulfate, and 20 mg/ml denatured sheared salmon sperm DNA, 
followed by washing the filters in 1 x SSC at about 37-50°C. The skilled artisan will recognize how to adjust 
the temperature, ionic strength, etc. as necessary to accommodate factors such as probe length and the like. 

The term "epitope tagged" when used herein refers to a chimeric polypeptide comprising a TAT 
polypeptide or anti-TAT antibody fused to a "tag polypeptide". The tag polypeptide has enough residues to 
provide an epitope against which an antibody can be made, yet is short enough such that it does not interfere 
with activity of the polypeptide to which it is fused. The tag polypeptide preferably also is fairly unique so that 
the antibody does not substantially cross-react with other epitopes. Suitable tag polypeptides generally have at 
least six amino acid residues and usually between about 8 and 50 amino acid residues (preferably, between about 
10 and 20 amino acid residues). 

"Active" or "activity" for the purposes herein refers to form(s) of a TAT polypeptide which retain a 
biological and/or an immunological activity of native or naturally-occurring TAT, wherein "biological" activity 
refers to a biological function (either inhibitory or stimulatory) caused by a native or naturally-occurring TAT 
other than the ability to induce the production of an antibody against an antigenic epitope possessed by a native 
or naturally-occurring TAT and an "immunological" activity refers to the ability to induce the production of 
an antibody against an antigenic epitope possessed by a native or naturally-occurring TAT. 

The term "antagonist" is used in the broadest sense, and includes any molecule that partially or fully 
blocks, inhibits, or neutralizes a biological activity of a native TAT polypeptide disclosed herein. In a similar 
manner, the term "agonist" is used in the broadest sense and includes any molecule that mimics a biological 
activity of a native TAT polypeptide disclosed herein. Suitable agonist or antagonist molecules specifically 
include agonist or antagonist antibodies or antibody fragments, fragments or amino acid sequence variants of 
native TAT polypeptides, peptides, antisense oligonucleotides, small organic molecules, etc. Methods for 
identifying agonists or antagonists of a TAT polypeptide may comprise contacting a TAT polypeptide with a 
candidate agonist or antagonist molecule and measuring a detectable change in one or more biological activities 
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normally associated with the TAT polypeptide. 

Treating" or "treatment" or "alleviation" refers to both therapeutic treatment and prophylactic or 
preventative measures, wherein the object is to prevent or slow down (lessen) the targeted pathologic condition 
or disorder. Those in need of treatment include those already with the disorder as well as those prone to have 
the disorder or those in whom the disorder is to be prevented. A subject or mammal is successfully "treated" 
for a TAT polypeptide-expressing cancer if, after receiving a therapeutic amount of an anti-TAT antibody, TAT 
binding oligopeptide or TAT binding organic molecule according to the methods of the present invention, the 
patient shows observable and/or measurable reduction in or absence of one or more of the following: reduction 
in the number of cancer cells or absence of the cancer cells; reduction in the tumor size; inhibition (i.e., slow 
to some extent and preferably stop) of cancer cell infiltration into peripheral organs including the spread of 
cancer into soft tissue and bone; inhibition (i.e. , slow to some extent and preferably stop) of tumor metastasis; 
inhibition, to some extent, of tumor growth; and/or relief to some extent, one or more of the symptoms 
associated with the specific cancer; reduced morbidity and mortality, and improvement in quality of life issues. 
To the extent the anti-TAT antibody or TAT binding oligopeptide may prevent growth and/or kill existing cancer 
cells, it may be cytostatic and/or cytotoxic. Reduction of these signs or symptoms may also be felt by the 
patient. 

The above parameters for assessing successful treatment and improvement in the disease are readily 
measurable by routine procedures familiar to a physician. For cancer therapy, efficacy can be measured, for 
example, by assessing the time to disease progression (TTP) and/or detennining the response rate (RR). 
Metastasis can be determined by staging tests and by bone scan and tests for calcium level and other enzymes 
to determine spread to the bone. CT scans can also be done to look for spread to the pelvis and lymph nodes 
in the area. Chest X-rays and measurement of liver enzyme levels by known methods are used to look for 
metastasis to the lungs and liver, respectively. Other routine methods for monitoring the disease include 
transrectal ultrasonography (TRUS) and transrectal needle biopsy (TRNB). 

For bladder cancer, which is a more localized cancer, methods to determine progress of disease include 
urinary cytologic evaluation by cystoscopy, monitoring for presence of blood in the urine, visualization of the 
urolhelial tract by sonography or an intravenous pyelogram, computed tomography (CT) and magnetic resonance 
imaging (MRI). The presence of distant metastases can be assessed by CT of the abdomen, chest x-rays, or 
radionuclide imaging of the skeleton. 

"Chronic" administration refers to administration of the agent(s) in a continuous mode as opposed to 
an acute mode, so as to maintain the initial therapeutic effect (activity) for an extended period of time. 
"Intermittent M administration is treatment that is not consecutively done without interruption, but rather is cyclic 
in nature. 

"Mammal" for purposes of the treatment of, alleviating the symptoms of or diagnosis of a cancer refers 
to any animal classified as a mammal, including humans, domestic and farm animals, and zoo, sports, or pet 
animals, such as dogs, cats, cattle, horses, sheep, pigs, goats, rabbits, etc. Preferably, the mammal is human. 

Administration "in combination with" one or more further therapeutic agents includes simultaneous 
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(concurrent) and consecutive administration in any order. 

"Carriers" as used herein include pharmaceutical^ acceptable carriers, excipients, or stabilizers which 
are nontoxic to the cell or mammal being exposed thereto at the dosages and concentrations employed. Often 
the physiologically acceptable carrier is an aqueous pH buffered solution. Examples of physiologically 
acceptable carriers include buffers such as phosphate, citrate, and other organic acids; antioxidants including 
ascorbic acid; low molecular weight (less than about 10 residues) polypeptide; proteins, such as serum albumin, 
gelatin, or immunoglobulins; hydrophilic polymers such as polyvinylpyrrolidone; amino acids such as glycine, 
glutamine, asparagine, arginine or lysine; monosaccharides, disaccharides, and other carbohydrates including 
glucose, mannose, or dextrins; chelating agents such as EDTA; sugar alcohols such as mannitol or sorbitol; salt- 
forming counterions such as sodium; and/or nonionic surfactants such as TWEEN® , polyethylene glycol (PEG), 
and PLURONICS®. 

By "solid phase" or "solid support" is meant a non-aqueous matrix to which an antibody, TAT binding 
oligopeptide or TAT binding organic molecule of the present invention can adhere or attach. Examples of solid 
phases encompassed herein include those formed partially or entirely of glass (e.g., controlled pore glass), 
polysaccharides (e.g., agarose), polyacrylamides, polystyrene, polyvinyl alcohol and silicones. In certain 
embodiments, depending on the context, the solid phase can comprise the well of an assay plate; in others it is 
a purification column (e.g., an affinity chromatography column). This term also includes a discontinuous solid 
phase of discrete particles, such as those described in U.S. Patent No. 4,275,149. 

A "liposome" is a small vesicle composed of various types of lipids, phospholipids and/or surfactant 
which is useful for delivery of a drug (such as a TAT polypeptide, an antibody thereto or a TAT binding 
oligopeptide) to a mammal. The components of the liposome are commonly arranged in a bilayer formation, 
similar to the lipid arrangement of biological membranes. 

A "small" molecule or "small" organic molecule is defined herein to have a molecular weight below 
about 500 Daltons. 

An "effective amount" of a polypeptide, antibody, TAT binding oligopeptide, TAT binding organic 
molecule or an agonist or antagonist thereof as disclosed herein is an amount sufficient to carry out a specifically 
stated purpose. An "effective amount" may be determined empirically and in a routine manner, in relation to 
the stated purpose. 

The term "therapeutically effective amount" refers to an amount of an antibody, polypeptide, TAT 
binding oligopeptide, TAT binding organic molecule or other drug effective to "treat" a disease or disorder in 
a subject or mammal. In the case of cancer, the therapeutically effective amount of the drug may reduce the 
number of cancer cells; reduce the tumor size; inhibit (i.e., slow to some extent and preferably stop) cancer cell 
infiltration into peripheral organs; inhibit (i.e., slow to some extent and preferably stop) tumor metastasis; 
inhibit, to some extent, tumor growth; and/or relieve to some extent one or more of the symptoms associated 
with the cancer. See the definition herein of "treating". To the extent the drug may prevent growth and/or kill 
existing cancer cells, it may be cytostatic and/or cytotoxic. 

A "growth inhibitory amount" of an anti-TAT antibody, TAT polypeptide, TAT binding oligopeptide 
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or TAT binding organic molecule is an amount capable of inhibiting the growth of a cell, especially tumor, e.g. , 
cancer cell, either in vitro or in vivo. A "growth inhibitory amount" of an anti-TAT antibody, TAT 
polypeptide, TAT binding oligopeptide or TAT binding organic molecule for purposes of inhibiting neoplastic 
cell growth may be determined empirically and in a routine manner. 

A "cytotoxic amount" of an anti-TAT antibody, TAT polypeptide, TAT binding oligopeptide or TAT 
5 binding organic molecule is an amount capable of causing the destruction of a cell, especially tumor, e.g., 
cancer cell, either in vitro or in vivo. A "cytotoxic amount" of an anti-TAT antibody, TAT polypeptide, TAT 
binding oligopeptide or TAT binding organic molecule for purposes of inhibiting neoplastic cell growth may be 
determined empirically and in a routine manner. 

The term "antibody" is used in the broadest sense and specifically covers, for example, single anti-TAT 

10 monoclonal antibodies (including agonist, antagonist, and neutralizing antibodies), anti-TAT antibody 

compositions with polyepitopic specificity, polyclonal antibodies, single chain anti-TAT antibodies, and 
fragments of anti-TAT antibodies (see below) as long as they exhibit the desired biological or immunological 
activity. The term "immunoglobulin" (Ig) is used interchangeable with antibody herein. 

An "isolated antibody" is one which has been identified and separated and/or recovered from a 

1 5 component of its natural environment. Contaminant components of its natural environment are materials which 
would interfere with diagnostic or therapeutic uses for the antibody, and may include enzymes, hormones, and 
other proteinaceous or nonproteinaceous solutes. In preferred embodiments, the antibody will be purified (1) 
to greater than 95 % by weight of antibody as determined by the Lowry method, and most preferably more than 
99% by weight, (2) to a degree sufficient to obtain at least 15 residues of N-terminal or internal amino acid 

20 sequence by use of a spinning cup sequenator, or (3) to homogeneity by SDS-PAGE under reducing or 
nonreducing conditions using Coomassie blue or, preferably, silver stain. Isolated antibody includes the 
antibody in situ within recombinant cells since at least one component of the antibody's natural environment will 
not be present. Ordinarily, however, isolated antibody will be prepared by at least one purification step. 

The basic 4-chain antibody unit is a heterotetrameric glycoprotein composed of two identical light (L) 

25 chains and two identical heavy (H) chains (an IgM antibody consists of 5 of the basic heterotetramer unit along 
with an additional polypeptide called J chain, and therefore contain 10 antigen binding sites, while secreted IgA 
antibodies can polymerize to form polyvalent assemblages comprising 2-5 of the basic 4-chain units along with 
J chain). In the case of IgGs, the 4-chain unit is generally about 150,000 daltons. Each L chain is linked to 
a H chain by one covalent disulfide bond, while the two H chains are linked to each other by one or more 

30 disulfide bonds depending on the H chain isotype. Each H and L chain also has regularly spaced intrachain 
disulfide bridges. Each H chain has at the N-terminus, a variable domain (V H ) followed by three constant 
domains (C H ) for each of the a and y chains and four C H domains for ^ and e isotypes. Each L chain has at the 
N-terminus, a variable domain (VJ followed by a constant domain (CJ at its other end. The V L is aligned with 
the V H and the C L is aligned with the first constant domain of the heavy chain (C H l). Particular amino acid 

35 residues are believed to form an interface between the light chain and heavy chain variable domains. The 
pairing of a V H and V L together forms a single antigen-binding site. For the structure and properties of the 
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different classes of antibodies, see, e.g., Basic and Clinical immunology. 8th edition, Daniel P. Stites, Abba 
I. Terr and Tristram G. Parslow (eds.), Appleton & Lange, Norwalk, CT, 1994, page 71 and Chapter 6. 

The L chain from any vertebrate species can be assigned to one of two clearly distinct types, called 
kappa and lambda, based on the amino acid sequences of their constant domains. Depending on the amino acid 
sequence of the constant domain of their heavy chains (C H ), immunoglobulins can be assigned to different 
5 classes or isotypes. There are five classes of immunoglobulins: IgA, IgD, IgE, IgG, and IgM, having heavy 
chains designated a, 6, e, y> and n, respectively. They and a classes are further divided into subclasses on the 
basis of relatively minor differences in C H sequence and function, e.g. , humans express the following subclasses: 
IgGl, IgG2, IgG3, IgG4, IgAl, and IgA2. 

The term "variable" refers to the fact that certain segments of the variable domains differ extensively 

10 in sequence among antibodies. The V domain mediates antigen binding and define specificity of a particular 
antibody for its particular antigen. However, the variability is not evenly distributed across the 1 10-amino acid 
span of the variable domains. Instead, the V regions consist of relatively invariant stretches called framework 
regions (FRs) of 15-30 amino acids separated by shorter regions of extreme variability called "hypervariable 
regions" that are each 9-12 amino acids long. The variable domains of native heavy and light chains each 

15 comprise four FRs, largely adopting a (5-sheet configuration, connected by three hypervariable regions, which 
form loops connecting, and in some cases forming part of, the p-sheet structure. The hypervariable regions in 
each chain are held together in close proximity by the FRs and, with the hypervariable regions from the other 
chain, contribute to the formation of the antigen-binding site of antibodies (see Kabat et al., Sequences of 
Proteins of Immunological Interest. 5th Ed. Public Health Service, National Institutes of Health, Bethesda, MD. 

20 (1991)). The constant domains are not involved directly in binding an antibody to an antigen, but exhibit various 
effector functions, such as participation of the antibody in antibody dependent cellular cytotoxicity (ADCC). 

The term "hypervariable region" when used herein refers to the amino acid residues of an antibody 
which are responsible for antigen-binding. The hypervariable region generally comprises amino acid residues 
from a "complementarity determining region" or "CDR" (e.g. around about residues 24-34 (LI), 50-56 (L2) 

25 and 89-97 (L3) in the V L , and around about 1-35 (HI), 50-65 (H2) and 95-102 (H3) in the V H ; Kabat et al., 
Sequences of Proteins of Immunological Interest. 5th Ed. Public Health Service, National Institutes of Health, 
Bethesda, MD. (1991)) and/or those residues from a "hypervariable loop" (e.g. residues 26-32 (LI), 50-52 (L2) 
and 91-96 (L3) in the V L , and 26-32 (HI), 53-55 (H2) and 96-101 (H3) in the V H ; Chothia and Lesk J. Mol. 
Biol. 196:901-917 (1987)). 

30 The term "monoclonal antibody" as used herein refers to an antibody obtained from a population of 

substantially homogeneous antibodies, i.e., the individual antibodies comprising the population are identical 
except for possible naturally occurring mutations that may be present in minor amounts. Monoclonal antibodies 
are highly specific, being directed against a single antigenic site. Furthermore, in contrast to polyclonal 
antibody preparations which include different antibodies directed against different determinants (epitopes), each 

35 monoclonal antibody is directed against a single determinant on the antigen. In addition to their specificity, the 
monoclonal antibodies are advantageous in that they may be synthesized uncontaminated by other antibodies. 
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The modifier "monoclonal" is not to be construed as requiring production of the antibody by any particular 
method. For example, the monoclonal antibodies useful in me present invention may be prepared by the 
hybridoma methodology first described by Kohler et al., Nature. 256:495 (1975), or may be made using 
recombinant DNA methods in bacterial, eukaryotic animal or plant cells (see, e.g. , U.S. Patent No. 4,816,567). 
The "monoclonal antibodies" may also be isolated from phage antibody libraries using the techniques described 
in Clackson et al., Nature, 352:624-628 (1991) and Marks et al., J. Mol. Biol. , 222:581-597 (1991), for 
example. 

Themonoclonal antibodies herein include "chimeric" antibodies in which a portion of the heavy and/or 
light chain is identical with or homologous to corresponding sequences in antibodies derived from a particular 
species or belonging to a particular antibody class or subclass, while the remainder of the chain(s) is identical 
with or homologous to corresponding sequences in antibodies derived from another species or belonging to 
another antibody class or subclass, as well as fragments of such antibodies, so long as they exhibit the desired 
biological activity (see U.S. Patent No. 4,816,567; and Morrison et al., Proc. Nad. Acad. Sci. USA. 81:6851- 
6855 ( 1984)). Chimeric antibodies of interest herein include "primanzed" antibodies comprising variable domain 
antigen-binding sequences derived from a non-human primate (e.g. Old World Monkey, Ape etc), and human 
constant region sequences. 

An "intact" antibody is one which comprises an antigen-binding site as well as a C L and at least heavy 
chain constant domains, C H 1, C H 2 and C„3. The constant domains may be native sequence constant domains 
(e.g. human native sequence constant domains) or amino acid sequence variant thereof. Preferably, the intact 
antibody has one or more effector functions. 

"Antibody fragments" comprise a portion of an intact antibody, preferably the antigen binding or 
variable region of the intact antibody. Examples of antibody fragments include Fab, Fab', F(ab') 2 , and Fv 
fragments; diabodies; linear antibodies (see U.S. Patent No. 5,641,870, Example 2; Zapata et al., Protein Eng. 
8(10): 1057-1062 [1995]); single-chain antibody molecules; and multispecific antibodies formed from antibody 
fragments. 1 

Papain digestion of antibodies produces two identical antigen-binding fragments, called "Fab" 
fragments, and a residual "Fc" fragment, a designation reflecting the ability to crystallize readily. The Fab 
fragment consists of an entire L chain along with the variable region domain of the H chain (V„), and the first 
constant domain of one heavy chain (C H 1). Each Fab fragment is monovalent with respect to antigen binding, 
i.e., it has a single antigen-binding site. Pepsin treatment of an antibody yields a single large F(ab^ fragment 
which roughly corresponds to two disulfide linked Fab fragments having divalent antigen-binding activity and 
is still capable of cross-linking antigen. Fab' fragments differ from Fab fragments by having additional few 
residues at the carboxy terminus of the C H 1 domain including one or more cysteines from the antibody hinge 
region. Fab'-SH is the designation herein for Fab' in which the cysteine residue(s) of the constant domains bear 
a free thiol group. F(ab') ! antibody fragments originally were produced as pairs of Fab ' fragments which have 
hinge cysteines between them. Other chemical couplings of antibody fragments are also known. 

The Fc fragment comprises the carboxy-tenninal portions of both H chains held together by disulfides. 
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The effector functions of antibodies are determined by sequences in the Fc region, which region is also the part 
recognized by Fc receptors (FcR) found on certain types of cells. 

"Fv" is the minimum antibody fragment which contains a complete antigen-recognition and -binding 
site. This fragment consists of a dimer of one heavy- and one light-chain variable region domain in tight, non- 
covalent association. From the folding of these two domains emanate six hypervariable loops (3 loops each from 
5 the H and L chain) that contribute the amino acid residues for antigen binding and confer antigen binding 

specificity to the antibody. However, even a single variable domain (or half of an Fv comprising only three 
CDRs specific for an antigen) has the ability to recognize and bind antigen, although at a lower affinity than the 
entire binding site. 

"Single-chain Fv" also abbreviated as "sFv n or w scFv" are antibody fragments that comprise the V H 
10 and V L antibody domains connected into a, single polypeptide chain. Preferably, the sFv polypeptide further 
comprises a polypeptide linker between the V H and V L domains which enables the sFv to form the desired 
structure for antigen binding. For a review of sFv, see Pluckthun in The Pharmacology of Monoclonal 
Antibodies, vol. 113, Rosenburg and Moore eds., Springer- Verlag, New York, pp. 269-315 (1994); Borrebaeck 
1995, infra. 

1 5 The term "diabodies" refers to small antibody fragments prepared by constructing sFv fragments (see 

preceding paragraph) with short linkers (about 5-10 residues) between the V H and V L domains such that inter- 
chain but not intra-chain pairing of the V domains is achieved, resulting in a bivalent fragment, i.e., fragment 
having two antigen-binding sites. Bispecific diabodies are heterodimers of two "crossover" sFv fragments in 
which the V H and V L domains of the two antibodies are present on different polypeptide chains. Diabodies are 

20 described more fully in, for example, EP 404,097; WO 93/11161; and Hollinger et al., Proc. Natl. Acad. Sci. 
USA, 90:6444-6448 (1993). 

"Humanized" forms of non-human {e.g., rodent) antibodies are chimeric antibodies that contain 
minimal sequence derived from the non-human antibody. For the most part, humanized antibodies are human 
immunoglobulins (recipient antibody) in which residues from a hypervariable region of the recipient are replaced 

25 by residues from a hypervariable region of a non-human species (donor antibody) such as mouse, rat, rabbit or 
non-human primate having the desired antibody specificity, affinity, and capability. In some instances, 
framework region (FR) residues of the human immunoglobulin are replaced by corresponding non-human 
residues. Furthermore, humanized antibodies may comprise residues that are not found in the recipient antibody 
or in the donor antibody. These modifications are made to further refine antibody performance. In general, 

30 the humanized antibody will comprise substantially all of at least one, and typically two, variable domains, in 
which all or substantially all of the hypervariable loops correspond to those of a non-human immunoglobulin 
and all or substantially all of the FRs are those of a human immunoglobulin sequence. The humanized antibody 
optionally also will comprise at least a portion of an immunoglobulin constant region (Fc), typically that of a 
human immunoglobulin. For further details, see Jones et al., Nature 321:522-525 (1986); Riechmann et al., 

35 Nature 332:323-329 (1988); and Presta, Curr. On. Struct. Biol. 2:593-596 (1992). 

A "species-dependent antibody," e.g., a mammalian anti-human IgE antibody, is an antibody which 
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has a stronger binding affinity for an antigen from a first mammalian species than it has for a homologue of that 
antigen from a second mammalian species. Normally, the species-dependent antibody "bind specifically" to 
a human antigen (i.e., has a binding affinity (Kd) value of no more than about 1 x 1(T 7 M, preferably no more 
than about 1 x 10 s and most preferably no more than about 1 x 10' 9 M) but has a binding affinity for a 
homologue of the antigen from a second non-human mammalian species which is at least about 50 fold, or at 
least about 500 fold, or at least about 1000 fold, weaker than its binding affini ty for the human antigen. The 
species-dependent antibody can be of any of the various types of antibodies as defined above, but preferably is 
a humanized or human antibody. 

A "TAT binding oligopeptide n is an oligopeptide that binds, preferably specifically, to a TAT 
polypeptide as described herein. TAT binding oligopeptides may be chemically synthesized using known 
oligopeptide synthesis methodology or may be prepared and purified using recombinant technology. TAT 
binding oligopeptides are usually at least about 5 amino acids in length, alternatively at least about 6, 7, 8, 9, 
10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 
38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 
66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 
94, 95, 96, 97, 98, 99, or 100 amino acids in length or more, wherein such oligopeptides that are capable of 
binding, preferably specifically, to a TAT polypeptide as described herein. TAT binding oligopeptides may be 
identified without undue experimentation using well known techniques. In this regard, it is noted that techniques 
for screening oligopeptide libraries for oligopeptides that are capable of specifically binding to a polypeptide 
target are well known in the art (see, e.g., U.S. Patent Nos. 5,556,762, 5,750,373, 4,708,871, 4,833,092, 
5,223,409, 5,403,484, 5,571,689, 5,663,143; PCT Publication Nos. WO 84/03506 and WO84/03564; Geysen 
et al., Proc. Natl. Acad. Sci. U.S.A., 81:3998-4002 (1984); Geysen et al., Proc. Natl. Acad. Sci. U.S.A., 
82:178-182 (1985); Geysen et al., in Synthetic Peptides as Antigens, 130-149 (1986); Geysen et al., J. 
Immunol. Meth., 102:259-274 (1987); Schoofs et al., J. Immunol., 140:611-616 (1988), Cwirla, S. E. et al. 
(1990) Proc. Nad. Acad. Sci. USA, 87:6378; Lowman, H.B. et al. (1991) Biochemistry, 30:10832; Clackson, 
T. et al. (1991) Nature, 352: 624; Marks, J. D. et al. (1991), J. Mol. Biol., 222:581; Kang, A.S. et al. (1991) 
Proc. Natl. Acad. Sci. USA, 88:8363, and Smith, G. P. (1991) Current Opin. Biotechnol., 2:668). 
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A "TAT binding organic motecule" is an organic molecule other than an oligopeptide or antibody as 
defined herein that binds, preferably specifically, to a TAT polypeptide as described herein. TAT binding 
organic molecules may be identified and chemically synthesized using known methodology (see, e.g., PCT 
Publication Nos. WO00/00823 and WO00/39585), TAT binding organic molecules are usually less than about 
2000 daltons in size, alternatively less than about 1500, 750, 500, 250 or 200 daltons in size, wherein such 
5 organic molecules that are capable of binding, preferably specifically, to a TAT polypeptide as described herein 
may be identified without undue experimentation using well known techniques. In this regard, it is noted that 
techniques for screening organic molecule libraries for molecules that are capable of binding to a polypeptide 
target are well known in the art (see, e.g., PCT Publication Nos. WO00/00823 and WO00/39585). 

An antibody, oligopeptide or other organic molecule "whichbinds" an antigen of interest, e.g. atumor- 
1 0 associated polypeptide antigen target, is one that binds the antigen with sufficient affinity such that the antibody, 
oligopeptide or other organic molecule is useful as a diagnostic and/or therapeutic agent in targeting a cell or 
tissue expressing the antigen, and does not significantly cross-react with other proteins. In such embodiments, 
the extent of binding of the antibody, oligopeptide or other organic molecule to a "non-target" protein will be 
less than about 10% of the binding of the antibody, oligopeptide or other organic molecule to its particular target 

1 5 protein as determined by fluorescence activated cell sorting (FACS) analysis or radioimmunoprecipitation (RIA) . 
With regard to the binding of an antibody, oligopeptide or other organic molecule to a target molecule, the term 
"specific binding" or "specifically binds to" or is "specific for" a particular polypeptide or an epitope on a 
particular polypeptide target means binding that is measurably different from a non-specific interaction. Specific 
binding can be measured, for example, by determining binding of a molecule compared to binding of a control 

20 molecule, which generally is a molecule of similar structure that does not have binding activity. For example, 
specific binding can be determined by competition with a control molecule that is similar to the target, for 
example, an excess of non-labeled target. In this case, specific binding is indicated if the binding of the labeled 
target to a probe is competitively inhibited by excess unlabeled target. The term "specific binding" or 
"specifically binds to" or is "specific for" a particular polypeptide or an epitope on a particular polypeptide 

25 target as used herein can be exhibited, for example, by a molecule having a Kd for the target of at least about 
10- 4 M, alternatively at least about 10" 5 M, alternatively at least about 10" 6 M, alternatively at least about 10 1 
M, alternatively at least about 10" 8 M, alternatively at least about 10" 9 M, alternatively at least about 10 10 M, 
alternatively at least about 10 n M, alternatively at least about ltf 12 M, or greater. In one embodiment, the term 
"specific binding" refers to binding where a molecule binds to a particular polypeptide or epitope on a particular 

30 polypeptide without substantially binding to any other polypeptide or polypeptide epitope. 

An antibody, oligopeptide or other organic molecule that "inhibits the growth of tumor cells expressing 
a TAT polypeptide" or a "growth inhibitory" antibody, oligopeptide or other organic molecule is one which 
results in measurable growth inhibition of cancer cells expressing or overexpressing the appropriate TAT 
polypeptide. The TAT polypeptide may be a transmembrane polypeptide expressed on the surface of a cancer 

35 cell or may be a polypeptide that is produced and secreted by a cancer cell. Preferred growth inhibitory anti- 
TAT antibodies, oligopeptides or organic molecules inhibit growth of TAT-expressing tumor cells by greater 
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than 20%, preferably from about 20% to about 50%, and even more preferably, by greater than 50% (e.g., 
from about 50% to about 100%) as compared to the appropriate control, the control typically being tumor cells 
not treated with the antibody, oligopeptide or other organic molecule being tested. In one embodiment, growth 
inhibition can be measured at an antibody concentration of about 0. 1 to 30 ng/ml or about 0.5 nM to 200 nM 
in cell culture, where the growth inhibition is determined 1-10 days after exposure of the tumor cells to the 
antibody. Growth inhibition of tumor cells in vivo can be determined in various ways such as is described in 
the Experimental Examples section below. The antibody is growth inhibitory in vivo if administration of the 
anti-TAT antibody at about 1 jig/kg to about 100 mg/kg body weight results in reduction in tumor size or tumor 
cell proliferation within about 5 days to 3 months from the first administration of the antibody, preferably within 
about 5 to 30 days. 

An antibody, oligopeptide or other organic molecule which "induces apoptosis" is one which induces 
programmed cell death as determined by binding of annexin V, fragmentation of DNA, cell shrinkage, dilation 
of endoplasmic reticulum, cell fragmentation, and/or formation of membrane vesicles (called apoptotic bodies). 
The cell is usually one which overexpresses a TAT polypeptide. Preferably the cell is a tumor cell, e.g., a 
prostate, breast, ovarian, stomach, endometrial, lung, kidney, colon, bladder cell. Various methods are 
available for evaluating the cellular events associated with apoptosis. For example, phosphatidyl serine (PS) 
translocation can be measured by annexin binding; DNA fragmentation can be evaluated through DNA 
laddering; and nuclear/chromatin condensation along with DNA fragmentation can be evaluated by any increase 
in hypodiploid cells. Preferably, the antibody, oligopeptide or other organic molecule which induces apoptosis 
is one which results in about 2 to 50 fold, preferably about 5 to 50 fold, and most preferably about 10 to 50 fold, 
induction of annexin binding relative to untreated cell in an annexin binding assay. 

Antibody "effector functions* refer to those biological activities attributable to the Fc region (a native 
sequence Fc region or amino acid sequence variant Fc region) of an antibody, and vary with the antibody 
isotype. Examples of antibody effector functions include: Clq binding and complement dependent cytotoxicity; 
Fc receptor binding; antibody-dependent cell-mediated cytotoxicity (ADCC); phagocytosis; down regulation of 
cell surface receptors (e.g., B cell receptor); and B cell activation. 

"Antibody-dependent cell-mediated cytotoxicity" or "ADCC" refers to a form of cytotoxicity in which 
secreted Ig bound onto Fc receptors (FcRs) present on certain cytotoxic cells (e.g., Natural Killer (NK) cells, 
neutrophils, and macrophages) enable these cytotoxic effector cells to bind specifically to an antigen-bearing 
target cell and subsequently kill the target cell with cytotoxins. The antibodies "arm" the cytotoxic cells and 
are absolutely required for such killing. The primary cells for mediating ADCC, NK cells, express FcyRIII 
only, whereas monocytes express FcyRI, FcyRH and FcyRIII. FcR expression on hematopoietic cells is 
summarized in Table 3 on page 464 of Ravetch and Kinet, Annu, Rev. Immunol. 9:457-92 (1991). To assess 
ADCC activity of a molecule of interest, an in vitro ADCC assay, such as that described in US Patent No. 
5,500,362 or 5,821,337 may be performed. Useful effector cells for such assays include peripheral blood 
mononuclear cells (PBMC) and Natural Killer (NK) cells. Alternatively, or additionally, ADCC activity of the 
molecule of interest may be assessed in vivo, e.g., in a animal model such as that disclosed in Clynes et al. 
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(USA) 95:652-656 (1998). 

"Fc receptor" or "FcR" describes a receptor that binds to the Fc region of an antibody. The preferred 
FcR is a native sequence human FcR. Moreover, a preferred FcR is one which binds an IgG antibody (a gamma 
receptor) and includes receptors of the FcyRI, FcyRII and FcyRHI subclasses, including allelic variants and 
alternatively spliced forms of these receptors. FcyRII receptors include FcyRIIA (an "activating receptor") and 
FcyRIIB (an "inhibiting receptor"), which have similar amino acid sequences that differ primarily in the 
cytoplasmic domains thereof. Activating receptor Fc yRIIA contains an immunoreceptor tyrosine-based 
activation motif (ITAM) in its cytoplasmic domain. Inhibiting receptor FcyRIIB contains an immunoreceptor 
tyrosine-based inhibition motif (ITIM) in its cytoplasmic domain, (see review M. in Daeron, Annu. Rev. 
faraunQ L 15:203-234 (1997)). FcRs are reviewed in Ravetch and Kinet, Annu. Rev. Immunol. 9:457-492 
(1991); Capel et al., Immunomethods 4:25-34 (1994); and de Haas et al., J. Lab. Clin. Med. 126:330-41 
(1995). Other FcRs, including those to be identified in the future, are encompassed by the term "FcR" herein. 
The term also includes the neonatal receptor, FcRn, which is responsible for the transfer of maternal IgGs to 
the fetus (Guyer et al., J. Immunol. 117:587 (1976) and Kim et al., J. Immunol. 24:249 (1994)). 

"Human effector cells " are leukocytes which express one or more FcRs and perform effector functions . 
Preferably, the cells express at least Fc yRm and perform ADCC effector function. Examples of human 
leukocytes which mediate ADCC include peripheral blood mononuclear cells (PBMC), natural killer (NK) cells, 
monocytes, cytotoxic T cells and neutrophils; with PBMCs and NK cells being preferred. The effector cells may 
be isolated from a native source, e.g., from blood. 

"Complement dependent cytotoxicity " or "CDC" refers to the lysis of a target ceil in the presence of 
complement. Activation of the classical complement pathway is initiated by the binding of the first component 
of the complement system (Clq) to antibodies (of the appropriate subclass) which are bound to their cognate 
antigen. To assess complement activation, a CDC assay, e.g., as described in Gazzano-Santoro et al., L 
Immunol. Methods 202:163 (1996), may be performed. 

The terms "cancer" and "cancerous" refer to or describe the physiological condition in mammals that 
is typically characterized by unregulated cell growth. Examples of cancer include, but are not limited to, 
carcinoma, lymphoma, blastoma, sarcoma, and leukemia or lymphoid malignancies. More particular examples 
of such cancers include squamous cell cancer (e.g., epithelial squamous cell cancer), lung cancer including 
small-cell lung cancer, non-small cell lung cancer, adenocarcinoma of the lung and squamous carcinoma of the 
lung, cancer of the peritoneum, hepatocellular cancer, gastric or stomach cancer including gastrointestinal 
cancer, pancreatic cancer, glioblastoma, cervical cancer, ovarian cancer, liver cancer, bladder cancer, cancer 
of the urinary tract, hepatoma, breast cancer, colon cancer, rectal cancer, colorectal cancer, endometrial or 
uterine carcinoma, salivary gland carcinoma, kidney or renal cancer, prostate cancer, vulval cancer, thyroid 
cancer, hepatic carcinoma, anal carcinoma, penile carcinoma, melanoma, multiple myeloma and B-cell 
lymphoma, brain, as well as head and neck cancer, and associated metastases. 

The terms "cell proliferative disorder" and "proliferative disorder" refer to disorders that are 
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associated with some degree of abnormal cell proliferation. In one embodiment, the cell proliferative disorder 
is cancer. 

"Tumor", as used herein, refers to all neoplastic cell growth and proliferation, whether malignant or 
benign, and all pre-cancerous and cancerous cells and tissues. 

An antibody, oligopeptide or other organic molecule which "induces cell death" is one which causes 
5 a viable cell to become nonviable. The cell is one which expresses a TAT polypeptide, preferably a cell that 
overexpresses a TAT polypeptide as compared to a normal cell of the same tissue type. The TAT polypeptide 
may be a transmembrane polypeptide expressed on the surface of a cancer cell or may be a polypeptide that is 
produced and secreted by a cancer cell. Preferably, the cell is a cancer cell, e.g., a breast, ovarian, stomach, 
endometrial, salivary gland, lung, kidney, colon, thyroid, pancreatic or bladder cell. Cell death in vitro may 

10 be determined in the absence of complement and immune effector cells to distinguish cell death induced by 
antibody-dependent cell-mediated cytotoxicity (ADCC) or complement dependent cytotoxicity (CDC). Thus, 
the assay for cell death may be performed using heat inactivated serum (i.e. , in the absence of complement) and 
in the absence of immune effector cells. To determine whether the antibody, oligopeptide or other organic 
molecule is able to induce cell death, loss of membrane integrity as evaluated by uptake of propidium iodide 

15 (PI), trypan blue (see Moore et al. Cvtotechnology 17:1-11 (1995)) or 7AAD can be assessed relative to 
untreated cells. Preferred cell death-inducing antibodies, oligopeptides or other organic molecules are those 
which induce PI uptake in the PI uptake assay in BT474 cells. 

A "TAT-expressing celT is a cell which expresses an endogenous or transfected TAT polypeptide 
either on the cell surface or in a secreted form. A "TAT-expressing cancer" is a cancer comprising cells that 

20 have a TAT polypeptide present on the cell surface or that produce and secrete a TAT polypeptide. A "TAT- 
expressing cancer" optionally produces sufficient levels of TAT polypeptide on the surface of cells thereof, such 
that an anti-TAT antibody, oligopeptide ot other organic molecule can bind thereto and have a therapeutic effect 
with respect to the cancer. In another embodiment, a "TAT-expressing cancer" optionally produces and 
secretes sufficient levels of TAT polypeptide, such that an anti-TAT antibody, oligopeptide ot other organic 

25 molecule antagonist can bind thereto and have a therapeutic effect with respect to the cancer. With regard to 
the latter, the antagonist may be an antisense oligonucleotide which reduces, inhibits or prevents production and 
secretion of the secreted TAT polypeptide by tumor cells. A cancer which "overexpresses" a TAT polypeptide 
is one which has significantly higher levels of TAT polypeptide at the cell surface thereof, or produces and 
secretes, compared to a noncancerous cell of the same tissue type. Such overexpression may be caused by gene 

30 amplification or by increased transcription or translation. TAT polypeptide overexpression may be determined 
in a diagnostic or prognostic assay by evaluating increased levels of the TAT protein present on the surface of 
a cell, or secreted by the cell (e.g., via an immunohistochemistry assay using anti-TAT antibodies prepared 
against an isolated TAT polypeptide which may be prepared using recombinant DNA technology from an 
isolated nucleic acid encoding the TAT polypeptide; FACS analysis, etc.). Alternatively, or additionally, one 

3 5 may measure levels of TAT polypeptide-encoding nucleic acid or mRNA in the cell, e.g. , via fluorescent in situ 
hybridization using a nucleic acid based probe corresponding to a TAT-encoding nucleic acid or the complement 
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thereof; (FISH; see W098/45479 published October, 1998), Southern blotting, Northern blotting, or polymerase 
chain reaction (PCR) techniques, such as real time quantitative PCR (RT-PCR). One may also study TAT 
polypeptide overexpression by measuring shed antigen in a biological fluid such as serum, e.g, using antibody- 
based assays (see also, e.g., U.S. Patent No. 4,933,294 issued June 12, 1990; WO91/05264 published April 
18, 1991; U.S. Patent 5,401,638 issued March 28, 1995; and Sias et al., J. Immunol. Martini 132:73-80 
(1990)). Aside from the above assays, various in vivo assays are available to the skilled practitioner. For 
example, one may expose cells within the body of the patient to an antibody which is optionally labeled with 
a detectable label, e.g. , a radioactive isotope, and binding of the antibody to cells in the patient can be evaluated, 
e.g., by external scanning for radioactivity or by analyzing a biopsy taken from a patient previously exposed 
to the antibody. 

As used herein, the term "immunoadhesin" designates antibody-like molecules which combine the 
binding specificity of a heterologous protein (an "adhesin") with the effector functions of immunoglobulin 
constant domains. Structurally, the immunoadhesins comprise a fusion of an amino acid sequence with the 
desired binding specificity which is other than the antigen recognition and binding site of an antibody (i.e., is 
"heterologous"), and an immunoglobulin constant domain sequence. The adhesin part of an immunoadhesin 
molecule typically is a contiguous amino acid sequence comprising at least the binding site of a receptor or a 
ligand. The immunoglobulin constant domain sequence in the immunoadhesin may be obtained from any 
immunoglobulin, such as IgG-1, IgG-2, IgG-3, or IgG-4 subtypes, IgA (including IgA-1 and IgA-2), IgE, IgD 
or IgM. 

The word "label" when used herein refers to a detectable compound or composition which is conjugated 
directly or indirectly to the antibody, oligopeptide or other organic molecule so as to generate a "labeled" 
antibody, oligopeptide or other organic molecule. The label may be detectable by itself (e.g. radioisotope labels 
or fluorescent labels) or, in the case of an enzymatic label, may catalyze chemical alteration of a substrate 
compound or composition which is detectable. 

The term "cytotoxic agent" as used herein refers to a substance that inhibits or prevents the function 
of cells and/or causes destruction of cells. The term is intended to include radioactive isotopes (e.g., I 131 , 
ji25^ y 90 , Re 186 , Re 188 , Sm 153 , Bi 212 , P 32 and radioactive isotopes of Lu), chemotherapeutic agents e.g. 
methotrexate, adriamicin, vinca alkaloids (vincristine, vinblastine, etoposide), doxorubicin, melphalan, 
mitomycin C, chlorambucil, daunorubicin or other intercalating agents, enzymes and fragments thereof such 
as nucleolytic enzymes, antibiotics, and toxins such as small molecule toxins or enzymatically active toxins of 
bacterial, fungal, plant or animal origin, including fragments and/or variants thereof, and the various antitumor 
or anticancer agents disclosed below. Other cytotoxic agents are described below. A tumoricidal agent causes 
destruction of tumor cells. 

A "growth inhibitory agent" when used herein refers to a compound or composition which inhibits 
growth of a cell, especially a TAT-expressing cancer cell, either in vitro or in vivo. Thus, the growth inhibitory 
agent may be one which significantly reduces the percentage of TAT-expressing cells in S phase. Examples of 
growth inhibitory agents include agents that block cell cycle progression (at a place other than S phase), such 
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as agents that induce Gl arrest and M-phase arrest. Classical M-phase blockers include the vincas (vincristine 
and vinblastine), taxanes, and topoisomerase II inhibitors such as doxorubicin, epirubicin, daunorubicin, 
etoposide, and bleomycin. Those agents that arrest Gl also spill over into S-phase arrest, for example, DNA 
alkylating agents such as tamoxifen, prednisone, dacarbazine, mechlorethamine, cisplatin, methotrexate, 5- 
fluorouracil, and ara-C. Further information can be found i rfThe Molecular Basis of Omr^r , Mendelsohn and 
Israel, eds., Chapter 1, entitled "Cell cycle regulation, oncogenes, and antineoplastic drugs" by Murakami et 
al. (WB Saunders: Philadelphia, 1995), especially p. 13. The taxanes (paclitaxel and docetaxel) are anticancer 
drugs both derived from the yew tree. Docetaxel (TAXOTERE®, Rhone-Poulenc Rorer), derived from the 
European yew, is a semisynthetic analogue of paclitaxel (TAXOL®, Bristol-Myers Squibb). Paclitaxel and 
docetaxel promote the assembly of microtubules from tubulin dimers and stabilize microtubules by preventing 
depolymerization, which results in the inhibition of mitosis in cells. 

"Doxorubicin" is an anthracycline antibiotic. The full chemical name of doxorubicin is (8S-cis)-10-[(3- 
ammo-2,3 ,6-trideoxy«-I^lyxo-hexapyranosyl)oxy]-7,8,9, 10-tetrahydro-6,8, 1 l-trihydroxy-8-<hydroxyacetyl)-l- 
methoxy-5, 12-naphthacenedione. 

The term "cytokine" is a generic term for proteins released by one cell population which act on another 
cell as intercellular mediators. Examples of such cytokines are lymphokines, monokines, and traditional 
polypeptide hormones. Included among the cytokines are growth hormone such as human growth hormone, N- 
methionyl human growth hormone, and bovine growth hormone; parathyroid hormone; thyroxine; insulin; 
proinsulin; relaxin; prorelaxin; glycoprotein hormones such as follicle stimulating hormone (FSH), thyroid 
stimulating hormone (TSH), and luteinizing hormone (LH); hepatic growth factor; fibroblast growth factor; 
prolactin; placental lactogen; tumor necrosis factor-a and -p; muUerian-inhibiting substance; mouse 
gonadotropin-associated peptide; inhibin; activin; vascular endothelial growth factor; integrin; thrombopoietin 
(TPO); nerve growth factors such as NGF-p; platelet-growth factor; transforming growth factors (TGFs) such 
as TGF-a and TGF- P; insulin-like growth factor-I and -II; erythropoietin (EPO); osteoinductive factors; 
interferons such as interferon -a, -p, and -y; colony stimulating factors (CSFs) such as macrophage-CSF (M- 
CSF); granulocyte-macrophage-CSF(GM-CSF); andgranulocyte-CSF (G-CSF); interleukins (ILs) such as IL-1, 
IL- la, IL-2, 11^3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-11, IL-12; a tumor necrosis factor such as TNF-a 
or TNF-B; and other polypeptide factors including LIF and kit ligand (KL). As used herein, the term cytokine 
includes proteins from natural sources or from recombinant cell culture and biologically active equivalents of 
the native sequence cytokines. 
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The term "package insert" is used to refer to instructions customarily included in commercial packages 
of therapeutic products, that contain information about the indications, usage, dosage, administration, 
contraindications and/or warnings concerning the use of such therapeutic products. 
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Table 1 

/* 
* 

* G-C increased from 12 to 15 

* Z is average of EQ 
*B is average of ND 

* match with stop is _M; stop-stop = 0; J (joker) match = 0 
*/ 

^define _M -8 /* value of a match with a stop */ 

iot day[26][26] = { 

/* a'bcdefghijklmnopqrstuvwxyz*/ 

/* A */ { 2, 0,-2, 0, 0,-4, 1,-1,-1, 0,-1,-2,-1, 0,_M, 1, 0,-2, 1, 1, 0, 0,-6, 0,-3, 0}, 

/* B */ { 0, 3,-4, 3, 2,-5, 0, 1,-2, 0, 0,-3,-2, 2 f _M,-l, 1, 0, 0, 0, 0,-2,-5, 0,-3, 1}, 

/* C */ {-2,-4,15,-5,-5,-4,-3,-3,-2, 0,-5,-6,-5,-4,JVI,-3,-5,A 0,-2, 0,-2,-8, 0, 0,-5}, 

/* D */ { 0, 3,-5, 4, 3,-6, 1, 1,-2, 0, 0,-4,-3, 2,_M,-1, 2,-1, 0, 0, 0,-2,-7, 0,-4, 2}, 

/* E */ { 0, 2,-5, 3, 4,-5, 0, 1,-2, 0, 0,-3,-2, 1..M.-1. 2,-1, 0, 0, 0,-2,-7, 0,-4, 3}, 

/* F */ {-4,-5,-4,-6,-5, 9,-5,-2, 1, 0,-5, 2, 0,-4,JM,-5,-5,-4,-3,-3, 0,-1, 0, 0, 7,-5}, 

/* G */ { 1, 0,-3, 1, 0,-5, 5,-2,-3, 0,-2,-4,-3, 0,_M,-l,-l,-3, 1, 0, 0,-1,-7, 0,-5, 0}, 

/* H */ {-1, 1,-3, 1, 1,-2,-2, 6,-2, 0, 0,-2,-2, 2,_M, 0, 3, 2,-1,-1, 0,-2,-3, 0, 0, 2}, 

/* i */ {-1,-2,-2,-2,-2, 1,-3,-2, 5, 0,-2, 2, 2,-2,_M,-2,-2,-2,-l, 0, 0, 4,-5, 0,-1,-2}, 

/* J */ { 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,_M, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0}, 

/* K */ {-1, 0,-5, 0, 0,-5,-2, 0,-2, 0, 5,-3, 0, 1,_M,-1, 1, 3, 0, 0, 0,-2,-3, 0,-4, 0}, 

/* L */ {-2,-3,-6,-4,-3, 2,-4,-2, 2, 0,-3, 6, 4,-3, _M,-3, -2,-3,-3,-1, 0, 2,-2, 0,-1,-2}, 

/* M */ {-1,-2,-5,-3,-2, 0,-3,-2, 2, 0, 0, 4, 6,-2,_M,-2,-l, 0,-2,-1, 0, 2,-4, 0,-2,-1}, 

/* N */ { 0, 2,-4, 2, 1,-4, 0, 2,-2, 0, 1,-3,-2, 2,_M,-1, 1, 0, 1, 0, 0,-2,-4, 0,-2, 1}, 

/*0*/ {_M, M,_M,_M,JM,JM, M,_M,_M, M,_M,_M,_M,_M, 0,_M,_M f _M,_M,. 

/* P */ { 1,-1,-3,-1,-1,-5,-1, 0,-2rO,-l,-3,-2,-T,_M, 6, 0, 0, 1, 0, 0,-1,-6, 0,-5, 0}, 

/* Q */ { 0, 1,-5, 2, 2,-5,-1, 3,-2, 0, 1,-2,-1, 1,_M, 0, 4, 1,-1,-1, 0,-2,-5, 0,-4, 3}, 

/* R */ {-2, 0,-4,-1,-1,-4,-3, 2,-2, 0, 3,-3, 0, 0,_M, 0, 1, 6, 0,-1, 0,-2, 2, 0,-4, 0}, 

/* S */ { 1, 0, 0, 0, 0,-3, 1,-1,-1, 0, 0,-3,-2, 1,_M, 1,-1, 0, 2, 1, 0,-1,-2, 0,-3, 0}, 
/* T */ { 1, 0,-2, 0, 0,-3, 0,-1, 0, 0, 0,-1,-1, 0,_M, 0,-1,-1, 1, 3, 0, 0,-5, 0,-3, 0}, 
/* U */ { 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,_M, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0}, 
/* V */ { 0,-2,-2,-2,-2,-1,-1,-2, 4, 0,-2, 2, 2,-2,_M,- 1,-2,-2,-1, 0, 0, 4,-6, 0,-2,-2}, 
/♦ W */ {-6,-5,-8,-7,-7, 0,-7,-3,-5, 0,-3,-2,-4,-4,_M,-6,-5, 2,-2,-5, 0,-6,17, 0, 0,-6}, 
/* X */ { 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,_M, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0}, 
/* Y */ {-3,-3, 0,-4,-4, 7,-5, 0,-1, 0,-4,-l,-2,-2,JVI,-5,-4,-4,-3,-3, 0,-2, 0, 0,10,-4}, 
/* Z */ { 0, 1,-5, 2, 3,-5, 0, 2,-2, 0, 0,-2,-1, 1,_M, 0, 3, 0, 0, 0, 0,-2,-6, 0,-4, 4} 

}; 
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Table 1 (conf) 

/* 
*/ 

^include <stdio.h> 
^include <ctype.h> 



#define MAXJMP 


16 


/* max jumps in a diag */ 




^define MAXGAP 


24 


/* don't continue to penalize gai 


is larger than this */ 


^define JMPS 


1024 


/* max jmps in an path */ 


^define MX 


4 


/* save if there's at least MX-1 


bases since last jmp */ 


#define DMAT 


3 


/* value of matching bases */ 




^define DMIS 


0 


/* penalty for mismatched bases 


*/ 


^define DINSO 


8 


/* penalty for a gap */ 




#define DINS1 


1 


/* penalty per base */ 




#define PINSO 


8 


/* penalty for a gap */ 




#define PINS1 


4 


/* penalty per residue */ 




struct jmp { 









}; 



short 

unsigned short 



struct diag { 
int 
long 
short 
struct jmp 

}; 



n[MAXJMP]; /* size of jmp (neg for dely) */ 
x[MAXJMP]; /* base no. of jmp in seq x */ 
/♦limits seq to 2* 16-1 */ 



score; /* score at last jmp */ 

offset; /* offset of prev block */ 

ijmp; /* current jmp index */ 

jp; /* list of jmps */ 



struct path { 

S int spc; /* number of leading spaces */ 
short n[JMPS];/*sizeofjmp(gap)*/ 
int x[JMPS]; /* loc of jmp (last elem before gap) */ 

}» 

char *ofile; /* output file name */ 

char *namex[2]; /* seq names: getseqsQ */ 

char *prog; /* prog name for err msgs */ 

char *seqx[2]; /* seqs: getseqsO */ 

int dmax; /* best diag: nwO */ 

int dmaxO; /* final diag */ 

int dna; /* set if dna: mainO */ 

int endgaps; /* set if penalizing end gaps */ 

int gapx, gapy; /* total gaps in seqs */ 

int lenO, lenl; /* seq lens */ 

|nt ngapx, ngapy ; /* total size of gaps */ 

int smax; /* max score: nwO */ 

int *xbm; /* bitmap for matching */ 

long offset; /* current offset in jmp file */ 

struct diag *dx; /* holds diagonals */ 

struct path pp[2]; /* holds path for seqs */ 

char *calloc0, *mallocO, *index(), *strcpy0; 

char *getseq(), *g_calloc(); 
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10 



20 



Table 1 (conf) 

/* Needleman-Wunsch alignment program 
* 

* usage: progs filel £ile2 

* where filel and file2 are two dna or two protein sequences. 

* The sequences can be in upper- or lower-case an may contain ambiguity 

* Any lines beginning with * > 1 or ' < * are ignored 

* Max file length is 65535 (limited by unsigned short x in the jmp struct) 

* A sequence with 1/3 or more of its elements ACGTU is assumed to be DNA 

* Output is in the file "align.out" 



* The program may create a tmp file in /tmp to hold info about traceback. 

* Original version developed under BSD 4.3 on a vax 8650 
*/ 

^include "nw.h" 
15 #include "day.h" 



static _dbval[26] = { 

1 , 14,2, 13,0,0,4, 1 1 ,0,0, 12,0,3, 15,0,0,0,5,6,8,8,7,9,0, 10,0 

>; 



static _pbval[26] = { 

1,2|(1<<(TT-W))|(1<<('N^A')^ 4, 8, 16, 32, 64, 
128, 256, OxFFFFFFF, 1<<10, 1<<11, 1 < < 12, 1<<13, 1<<14, 
1<<15, 1<<16, 1<<17, 1<<18, 1<<19, 1< <20, 1< <21, 1<<22, 
25 1<<23, 1<<24, 1<<25|(1<<( , E , - , A , ))|(K<( , Q , - , A')) 

}; 

main(ac, av) main 
int ac; 
30 char *avQ; 

{ 

prog = av[0]; 
if(ac!=3){ 

f]printf(stderr, "usage: %s filel file2\n", prog); 
3 5 fprintf(stderr, "where filel and file2 are two dna or two protein sequences An"); 

rprintf(stderr,"The sequences can be in upper- or lower-case\n M ); 

r^rintf(stderr,"Any lines beginning with ';* or 1 < ' are ignored\n"); 

Q)rintf(stderr, "Output is in the file \ "align. out\"\n"); 

exit(l); 

40 } 

namex[0] = av[l]; 

namex[l] = av[2]; 

seqx[0] = getseq(namex[0], &len0); 

seqx[l] = getseq(namex[l], &lenl); 
45 xbm = (dna)? jibval : j>bval; 

endgaps = 0; /* 1 to penalize endgaps */ 

ofile = "align.out"; /* output file */ 

50 nwO; /* fill in the matrix, get the possible jmps */ 

readjmpsO; /* get the actual jmps */ 

printO; /* print stats, alignment */ 

cleanup(0); /* unlink any tmp files */} 
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Table 1 (conf) 

/* do the alignment, return best score: mainO 

* dna: values in Fitch and Smith, PNAS, 80, 1382-1386, 1983 

* pro: PAM 250 values 

* When scores are equal, we prefer mismatches to any gap, prefer 
5 * a new gap to extending an ongoing gap, and prefer a gap in seqx 

* to a gap in seq y. 
*/ 

nwO nw 

{ 

1 0 char *px, *py; /* seqs and ptrs */ 

int *ndely, *dely; /* keep track of dely */ 

int ndelx, delx; /* keep track of delx */ 

int *tmp; /* for swapping rowO, rowl */ 

int mis; /* score for each type */ 

15 hit insO, insl; /* insertion penalties */ 

register id; /* diagonal index */ 

register ij; /*jmp index*/ 

register *col0, toll; /* score for curr, last row */ 

register xx, yy; /* index into seqs */ 

20 

dx = (struct diag *)g_calloc("to get diags", len0+lenl + l, sizeof(struct diag)); 
ndely - (int *)g_calloc("to get ndely", lenl + 1, sizeof(int)); 
dely = (int *)g^_calloc( n to get dely", lenl + 1, sizeof(int)); 
colO = (int *)g_calloc( n to get colO", lenl + 1, sizeof(int)); 
25 coll = (int *)g_calloc( n to get coll lenl + 1, sizeof(int)); 

insO = (dna)? DINS0 : PINS0; 
insl = (dna)? DINS1 : PINS1; 
smax = -10000; 
if (endgaps) { 

30 for (col0[0] = dely[0] = -insO, yy = 1; yy < = lenl; yy++) { 

col0[yy] = delyfyy] = coI0[yy-l] - insl; 
ndely[yy] = yy; 

} 

col0[0] = 0; /* Waterman Bull Math Biol 84 */ 

35 } 



for (yy = 1; yy <= lenl; yy++) 
dely[yy] — -insO; 
/* fill in match matrix 
40 */ 

for (px = seqx[0], xx = 1; xx < = ienO; px+ + , xx++) { 
/* initialize first entry in col 
*/ 

if (endgaps) { 
45 if (xx == 1) 

coll[0] = delx = -(insO+insl); 

else 

coil[0] = delx = col0[0] - insl; ' 
ndelx — xx; 



50 } 

else { 



55 



coll[0] = 0; 
delx = -insO; 
ndelx = 0; 
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Table 1 (contn 

...nw 

for (py = seqx[l], yy = 1; yy <= leal; py+ + , yy++) { 
mis = col0[yy-l]; 
if(dna) 

mis + = (xbm[^x-W]<S^bm[*py-'A'])? DMAT : DMIS; 
mis + = _day[*px-' A'JP'py-'A'); 

/* update penalty for del in x seq; 

* favor new del over ongong del 

* ignore MAXGAP if weighting endgaps 
*/ 

if (endgaps 1 1 ndelyfyy] < MAXGAP) { 

if (colOfyy] - insO > = dely[yy]) { 

dely[yy] = col0[yy] - (insO+insl); 
ndely[yy] = 1; 

} else { 

dely[yy] — insl; 
ndely[yy]++; 

} 

} else { 

if (colO[yy] - (insO+insl) > = dely[yy]) { 
dely[yy] = col0[yy] - (insO+insl); 
ndely[yy] = 1; 

} else 

ndeiy[yy] + + ; 

} 

/* update penalty for del in y seq; 

* favor new del over ongong del 
*/ 

if (endgaps 1 1 ndelx < MAXGAP) { 

if (coll[yy-l] - insO >= delx) { 

delx = coll[yy-l] - (insO+insl); 
ndelx = 1; 

} else { 

delx -= insl; 
ndelx++; 

> 

} else { 

if (coll[yy-l] - (insO+insl) > = delx) { 
delx = coil[yy-l] - (insO+insl); 
ndelx = 1; 

} else 

ndelx++; 

> 

/* pick the maximum score; we're favoring 

* mis over any del and delx over dely 
*/ 



...nw 

id = xx - yy + lenl - 1 ; 
if (mis > = delx && mis > = dely[yy]) 
coll[yy] = mis; 
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else if (debt > = dely[yy]) { 
coil[yy] = debt; 
ij « dx[id].ijmp; 

if (dxpd] jp.n[0] && (!dna 1 1 (ndelx > = MAXJMP 
5 && xx > dx[id].jp.x[ij]+MX) 1 1 mis > dx[id]. score +DESFS0)) { 

dx[id].ijmp-*-+; 
if(++ij >= MAXJMP) { 
writejmps(id); 
ij « dx[id].ijmp = 0; 

10 dx[id] .offset « offset; 

offset -f = sizeof(struct jmp) + sireof(ofrset); 

} 

} 

dx[id].jp.n[ij] = ndelx; 
15 dx[id].jp.x[ij] « xx; 

dxfid]. score = delx; 

else { 

coll[yy] = dely[yy]; 
20 ij = dx{id].ijmp; 

if (dx[id].jp.n[0] && (!dna 1 1 (ndelyfyy] > = MAXJMP 

&& xx > dxpd] jp.x[ij]+MX) 1 1 mis > dx[id].score+DINS0)) { 
dx[id].ijmp+ + ; 
if (++ij >= MAXJMP) { 
25 writejmps(id); 

ij = dxpdj.ijmp = 0; 
dx[id].ofifeet - offset; 

offset += sizeof(struct jmp) + sizeof(offset); 

} 

30 } 

dx[id].jp.n[ij] « -ndely[yy]; 
dx[id].jp.x[ij] = xx; 
dx[id].score — dely[yy]; 

35 if (xx lenO && yy < lenl) { 

/* last col 
*/ 

if (endgaps) 

coll[yy] -= insO+insl*(lenl-yy); 
40 if (colllyy] > smax) { 

smax = coll[yy]; 
dmax = id; 

> 

> 

45 > 

if (eadgaps && xx < lenO) 

coll[yy-l] -= ins0+insl*(len0-xx); 
if (coIl[yy-l) > smax) { 

smax = coll[yy-l]; 
50 dmax = id; 

} 

tmp — colO; colO = coll; coll = trap; } 
(void) free((char *)ndely); 
(void) tree((char *)dely); 
55 (void) free((char *)co!0); 

{void) free((char *)coll); } 
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/* 

* printO ~ only routine visible outside this module 
* 

5 * static: 

* getmatO — trace back best path, count matches: printO 

* pr_alignO — print alignment of described in array p0: printO 

* dumpblockO — dump a block of lines with numbers, stars: pr_alignO 

* numsO — put out a number line: dumpblockO 

10 * putlineO - put out a line (name, [num], seq, [num]): dumpblockO 

* starsO - -put a line of stars: dumpblockO 

* stripnameO — strip any path and prefix from a seqname 
*/ 

15 ^include °nw.h w 

^define SPC 3 

^define P_LINE 256 /* maximum output line */ 
^define P_SPC 3 /* space between name or num and seq */ 

extern _day[26][26]; 

int olen; /* set output line length */ 

FILE *fx; /* output file */ 



20 



25 printO print 

{ 

int lx, ly, flrstgap, lastgap; /* overlap */ 

if ((fx » fopen(oflle, "w")) == 0) { 
30 fprintf(stderr, w %s: can't write %s\n", prog, ofile); 

cleanup(l); 

} 

fprintf(rx, " <first sequence: %s (length = %d)\n", namex[0], lenO); 
ffc>rintf(fx, "<second sequence: %s (length = %d)\n", namex[l], lenl); 
35 olen = 60; 

lx = lenO; 
ly = lenl; 

flrstgap = lastgap = 0; 

if (dmax < lenl - 1) { /* leading gap in x */ 
40 pp[0].spc = flrstgap = lenl - dmax - 1; 

ly-= pp[0].spc; 

> 

else if (dmax > lenl - 1) { /* leading gap in y */ 
pp[l]«spc = flrstgap = dmax - (lenl - 1); 
45 lx-=pp[l].spc; 
} 

if (dmaxO < lenO - 1) { /* trailing gap in x */ 
lastgap = lenO - dmaxO -1; 
lx-= lastgap; 

50 } 

else if (dmaxO > lenO - 1) { /* trailing gap in y */ 
lastgap = dmaxO - QenO - 1); 
ly -= lastgap; 

55 getmat(lx, ly, flrstgap, lastgap); 

pr_alignQ; } 
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/* 

* trace back the best path, count matches 
*/ 

static 

getmat(lx, ly, firstgap, lastgap) getmat 
int lx, ly; /* "core" (ininus endgaps) */ 

int firstgap, lastgap; /* leading trailing overlap */ 

int nm, iO, il, sizO, sizl; 

char outx[32]; 
double pet; 
register nO, nl; 

register char *p0, *pl; 
/* get total matches, score 
*/ 

iO = il = sizO = sizl — 0; 
pO = seqx[0] + pp[l).spc; 
pi = seqx[l] + pp[0].spc; 
nO = pp[l].spc + 1; 
nl = pp[0].spc + 1; 
nm = 0; 

while ( *p0 && *pl ) { 
if (sizO) { 

pl++; 

nl + + ; 
sizO—; 

} 

else if (sizl) { 

p0++; 
n0++; 
sizl-; 



else { 



if (xbmPpO-'A'l&xbmt^l-'A']) 

nm+ + ; 
if(nO++ ==pp[0].x[i0]) 

sizO = pp[0].n[iO++]; 
if(nl + + ==pp[l].x[il]) 

sizl = pp[l].n[il + +]; 

P0+ + ; 
pl + +; 



> 



/* pet homology: 

* if penalizing endgaps, base is the shorter seq 

* else, knock off overhangs and take shorter core 
*/ 

if (endgaps) 

lx = (lenO < lenl)? lenO : leal; 



lx = (lx < ly)? lx : ly; 
pet = 100.*(double)nm/(double)lx; 
fprintf(rx, "\n"); 

$>rintf(fx, " < %d match%s in an overlap of %d: %.2f percent similarity\n", 
mn, (nm == 1)? nn : °es\ lx, pet); 
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rprintf(rx, n < gaps in first sequence: %d" , gapx); . . • getmat 

if(gapx){ 

(void) sprintf(outx, " (%d %s%s)", 

ngapx, (dna)? "base": "residue", (ngapx == 1)? "Vs"); 

rprintf(rx,"%s", outx); 
rprintf(fx, gaps in second sequence: %d\ gapy); 
if(gapy){ 

(void) sprintf(outx, n (%d %s%s)", 

ngapy, (dna)? "base": "residue", (ngapy == 1)? "":"s"); 

fprintf(rx,"%s", outx); 

> 

if (dna) 

fprintf(tx, 

"\n< score: %& (match = %d, mismatch = %d, gap penalty = %d + %d per base)Vn", 
smax, DMAT, D3VOS, DINSO, DINS1); 

else 

rprintf(rx, 

"\n<score: %d (Dayhoff PAM 250 matrix, gap penalty = %d + %d per residue)\n", 
smax, PINSO, PINS1); 
if (endgaps) 

fprintf(fx, 

"<endgaps penalized, left endgap: %d %s%s, right endgap: %d %s%s\n", 
firstgap, (dna)? "base" : "residue", (firstgap == 1)? "" : "s", 
lastgap, (dna)? "base" : "residue", (lastgap == 1)? "" : "s"); 

rprintf(rx, "< endgaps not penalized\n"); 



} 

static nm; 
static lmax; 
static ij[2]; 
static nc[2]; 
static ni[2]; 
static siz[2]; 
static char *ps[2]; 
static char *po[2]; 
static char out[2]|P_LINE]; 
static char star [P JLINE] ; 

/* 

* print alignment of described in struct path ppQ 
*/ 

static 

pr_align0 

{ 

int nn; /* char count */ 

int more; 
register i; 

for (i = 0, lmax = 0; i < 2; i++) { 
nn = stripname(namex[i]); 
if (nn > lmax) 

lmax = nn; 
nc[i] = 1; 
ni[i] - 1; 
siz[i] = ij[x] = 0; 
ps[i] = seqx[i]; 
po[i] = out[i]; 



/* matches in core — for checking */ 
/* lengths of stripped file names */ 
/* jmp index for a path */ 
/* number at start of current line */ 
/* current elem number — for gapping */ 

/* ptr to current element */ 
/* ptr to next output char slot */ 
/* output line */ 
/* set by starsQ */ 



pralign 
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for (nn = nm = 0, more = 1; more; ) { ...pralign 
for (i = more = 0; i < 2; i++) { 
/* 

5 * do we have more of this sequence? 

*/ 

if(!*ps[i]) 

continue; 
more++; 

10 if (pp[i].spc) { /* leading space */ 

*po[i] + + = ■ '; 
pp[i],spc-; 

} 

else if (siz[i]) { /* in a gap */ 
15 *po[i] + + = 

siz[i]-; 

} 

else { /* we're putting a seq element 

*/ 

20 *po[i] = *ps[i]; 

if (islower(*ps[i])) 

*ps[i] = toupper(*ps[iJ); 

po[i] ++; 
ps[i] + +; 

25 /* 

* are we at next gap for this seq? 
*/ 

if(ni[i] pp[i].x[ij[i]]) { 

30 * we need to merge all gaps 

* at this location 
*/ 

siz[i] = pp[i].n[ij[i]++]; 
while (ni[ij = = pp[i].x[ij[i]]) 

siz[i] +=pp[i].n[ij[i] + +] ; 

} 

ni[i]+ + ; 

} 

> 

40 if (++nn == olen || Imore && nn) { 

dumpblockO; 
for(i = 0; i < 2; i + +) 
po[i] = outp]; 

nn = 0; 

45 } 



> 

/* 



} 



* dump a block of lines, including numbers, stars: pr alignO 
50 */ 

static 

dumpblockO dumpblock 
register i; 

55 for(i = 0; i < 2; i++) 

*poM~ = 'VO'; 
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(void) puteO\n\ fx); 
for(i = 0;i < 2; i++){ 

if (*out[i] && (*out[i] != 1 ' 1 1 *(po[i]) !=")){ 
if (i ==0) 

nums(i); 
if (i = = 0&& *out[l]) 
starsO; 

putline(i); 

if (i = =0&&*out[l]) 
$>rintf(fx, star); 

if(i= = 1) 

nums(i); 

} 

} 

} 

/* 

* put out a number line: dumpblockQ 
*/ 

static 
nums(ix) 

int ix; /* index in outQ holding seq line */ 

{ 

char nline[P_LINE]; 

register i,j; 

register char *pn, *px, *py; 

for (pn = nline, i = 0; i < Imax+PJSPC; i+ +, pn+ +) 
*pn = • '; 

for (i = nc[ix], py = out[ix]; *py; py+ + , pn++) { 

if(*py«=- || *py 

*pn - * '; 



else { 



if (i%10 == 0 || (i == 1 &&nc[ix] != 1)) { 
j - (i < 0)? -i : i; 
for (px = pn; j; j /= 10, px~) 
*px=j%10 + '0'; 

if (i < 0) 

*px = 



} 

else 

i+ + ; 



*pn 



> 

} 

*pn = *\0'; 
nc[ix] = i; 

for (pn = nline; *pn; pn+ +) 
(void) putc(*pn, fx); 
(void)putcC\n' f fx); 

} 

/* 

* put out a line (name, [num], seq, [num]): dumpblockQ 
*/ 

static 
putline(ix) 

int ix; { 



PCT/US2003/028547 



...dumpblock 



nums 



putline 
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int i; 
register char *px; 

for (px = namex[ix], i = 0; *px && *px != V; px++, i++) 

(void) putc(*px, fx); 
for (; i < lmax+P_SPC; i++) 

(void)putc(' \ fee); 

/* these count from 1: 

* niQ is current element (from 1) 

* ncQ is number at start of current line 
*/ 

for (px = out[ix]; *px; px++) 

(void) putc(*px&0x7F, fx); 
(void) putc('\n\ fee); 



/* 

* put a line of stars (seqs always in out[0], out[l]): dumpblockO 
*/ 

static 
starsO 
{ 

int i; 

register char *p0, *pl, cx, *px; 

if (!*out[0] 1 1 (*out[01 = =«'&& *(po[0]) == ' ') 1 1 
!*out[l] 1 1 (*out[l] =="&& *(po[l]) = =*•)) 
return; 
px = star; 

for (i = lmax+PJSPC; i; i-) 
*px++~= ' '; 

for (pO = out[0], pi - out[l); *p0 && *pl; p0+ +, pl + +) { 
if (Lsalpha(*pO) && isalpha(*pl)) { 

if (xbm^pO-'Altobmr^l-'A']) { 
cx = ■*'; 
nm++; 

} 

else if (!dna&& - day[*pO- , A , ][*pl- , A'] > 0) 



else 

cx = 



} 



cx ~ 
*px+ + = cx; 

} 

*px++ = 'W; 
*px = , \0'; 



•putline 



stars 
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/* 

* strip path or prefix from pn, return len: pr_align0 
*/ 

static 

stripname(pn) Stripname 
char *pn; /* file name (may be path) */ 

{ 

register char *px, *py; 



10 py=0; 

for (px ~ pn; *px; px+ +) 
if (*p X == •/•) 

py = px + 1; 

if(py) 

15 (void) srrcpy(pn, py); 

return(strlen(pn)); 



20 
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/* 

* cleanup 0 — cleanup any tmp file 

* getseqO — read in seq, set ana, len, maxlen 

* g_calloc0 - callocO with error checkin 

* readjmpsO — get the good jmps, from tmp file if necessary 

* writejmpsO - write a filled array of jmps to a tmp file: nwO 
*/ 

^include "nw.h" 
^include <sys/file.h> 



char *jname = "/tmp/homgXXXXXX"; 

FILE *rj; 

hit cleanupO; 

long lseekO; 

/* 

* remove any tmp file if we blow 
*/ 

cleanup(i) 

int i; 

{ 

(void) unlink(jname); 

exit(i); 

} 

/* 

* read, return ptr to seq, set dna, len, maxlen 

* skip lines starting with ';\ 1 < \ or 1 > 1 

* seq in upper or lower case 
*/ 

char * 

getseq(file, len) 

char *file; /* file name */ 
int *len; /* seq len */ 



/* tmp file for jmps */ 
/* cleanup tmp file */ 



cleanup 



getseq 



{ 



char line[1024], *pseq; 

register char *px, *py; 

int natgc, tlen; 

FILE *tp; 

if ((fp = fopen(file,V)) == 0) { 

fprintf(stderr, n %s: can't read %s\n" t prog, file); 

exit(l); 

> 

tlen = natgc = 0; 

while (fgets(line, 1024, fp)) { 

if(*line« ■;' |) *iine - = '<' || *line » '>') 

continue; 
for (px = line; *px != *\n'; px-h-f ) 

if (isupper(*px) 1 1 islower(*px)) 
tlen++; 

} 

if ((pseq = malloc((unsigned)(den+6))) == 0) { 

fprintf(stderr, n %s: mallocO failed to get %d bytes for %s\n", prog, tlen+6, file); 
exit(l); 

} 

pseq[0] = pseq[l] = pseq[2] = pseq[3] = AO'; 
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...getseq 

py = pseq + 4; 
*len = tien; 
rewind(fp); 

5 while (fgetsOine, 1024, rp)) { 

if (*line == ';' || *line == '<' | | *line == '>') 
continue; 

for (px = line; *px != '\n'; px++) { 
if (isupper(*px)) 

10 *py++ = *px; 

i else if (islower(*px)) 

*py+ + = toupper(*px); 
if (mdex( n ATGCU V(py-1))) 
natgc+ + ; 

15 } 
} 

*py++ = '\0'; 

*py = 'VO 1 ; 
(void) fclose(fp); 
20 dna = natgc > (tlen/3); 

return(pseq+4); 

> 

char * 

g_calloc(msg, nx, sz) gjcalloc 
25 char *msg; /* program, calling routine */ 

int nx, sz; /* number and size of elements */ 

{ 

char *px, *callocO; 

if ((px == calloc((unsigned)nx, (unsigned)sz)) == 0) { 
30 if(*msg){ 

Q>rintf(stderr, "%s: g_calloc() foiled %s (n=%d, sz=%d)\n\ prog, msg, nx, sz); 

exit(l); 

} 

35 return(px); 
} 

/* 

* get final jmps from dxQ or tmp file, set ppQ, reset dmax: mainQ 
40 */ 

readjmpsO readjmps 

{ 

int fd = -1; 

int siz, iO, il; 

45 register i, j, xx; 

if(S){ 

(void) fclose(fj); 

if ((fd = open(jname, 0__RDONLY, 0)) < 0) { 

r^rintf(stderr, "%s: can't openO %s\n", prog, jname); 
50 cleanup(l); 

} 

} 

for (i = iO = il = 0, dmaxO = dmax, xx = lenO; ; i+ +) { 
while (1) { 

55 for (j = dx[dmax].ijmp; j > = 0 && dx[dmax].jp.x[j] > = xx; j-) 
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...readjmps 

if (j < 0 && dx[dmax].offeet && tj) { 

(void) lseek(fd, dx[dmax]. offset, 0); 
(void) read(fid, (char *)&dx[dmax] jp, sizeof(struct jmp)); 
5 (void) read(fti, (char *)&dx[dmax] .offset, sizeof(dx[drnax].ofeet)); 

dx[dmax].ijmp = MAXJMP-1; } 

else 

break; } 

if(i> = JMPS){ 

10 fprintf(stderr, n %s: too many gaps in alignments", prog); 

cleanup(l); 

} 

if(j>=0){ 

siz = dx[dmax].jp.n[j]; 
15 xx = dx[dmax].jp.x[j]; 

dmax += siz; 

if (siz < 0) { /* gap in second seq */ 

pp[l].n[il] = -siz; 
xx + = siz; 

20 /* id = xx - yy + lenl - 1 */ 

pp[l].x[il] = xx - dmax + lenl - 1; 

gapy+ + ; 

ngapy -= siz; 
/* ignore MAXGAP when doing endgaps */ 
25 siz = (-siz < MAXGAP | | endgaps)? -siz : MAXGAP; 

il + + ; 

} 

else if (siz > 0) { /* gap in first seq */ 
pp[0].n[i0] = siz; 

30 pp[0].x[i0] = xx; 

gapx+ + ; 
ngapx += siz; 
/* ignore MAXGAP when doing endgaps */ 

siz = (siz < MAXGAP 1 1 endgaps)? siz : MAXGAP; 
35 i0+ + ; 

} 

} 



break; 

40 } 

/* reverse the order of jmps */ 
for (j = 0, i0-; j < iO; j + +, i0~) { 

i = PP[0].n[j]; pp[0].n[j] = pp[0].n[i0]; pp[0].n[i0] = i; 

i = pp[0].xQ]; pp[0],x[j] = pp[0].x[i0]; pp[0].x[i0] = i; 

45 } 

for(j =0, il-;j < il;j + + ,il-){ 

i = pp[l].n(j]; pp[l].n[j] = pp[l].n[il]; pp[l].n[il] = i; 
i = pp[l].x[j]; pp[l].x[j] = pp[l].x[il]; pp[l].x[il] = i; 

50 if(fd>=0) 

(void) close(fd); 

tf(5){ 

(void) unlink(jname); 
tj =0; 

55 offset = 0; 

} } 
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/* 

* write a filled jmp struct offset of the prev one (if any): nwO 
*/ 

writejmps(ix) Writejmps 
int ix; 

{ 

char *mktemp0; 
if(!S){ 

if (mktemp(jname) < 0) { 

rprintf(stderr, "%s: can't mktempO %s\n\ prog, jname); 
cleanup(l); 

> 

if ((5 = fopen(jname, V)) == 0) { 

fprintf(stdeiT, "%s: can't write %s\n", prog, jname); 
exit(l); 

} 

} 

(void) fwrite((char *)&dx[ix].jp, sizeof(stmct jmp), 1, fj); 
(void) fwrite((char *)&dx[ix] .offset, sizeof(dx[ix]. offset), 1, fj); 

} 
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TAT XXXXXXXXXXXXXXX (Length = 15 amino acids) 

Comparison Protein XXXXXYYYYYYY (Length = 12 amino acids) 

5 % amino acid sequence identity = 

(the number of identically matching amino acid residues between the two polypeptide sequences as determined 
by ALIGN-2) divided by (the total number of amino acid residues of the TAT polypeptide) = 

10 5 divided by 15 = 33.3% 

Table 3 

TAT XXXXXXXXXX (Length = 10 amino acids) 

15 Comparison Protein XXXXXYYYYYYZZYZ (Length = 15 amino acids) 

% amino acid sequence identity = 

(the number of identically matching amino acid residues between the two polypeptide sequences as determined 
20 by ALIGN-2) divided by (the total number of amino acid residues of the TAT polypeptide) = 

5 divided by 10 = 50% 

Table 4 

25 

TAT-DNA NNNNNNNNNNNNNN (Length = 14 nucleotides) 

Comparison DNA NNNNNNLLLLLLLLLL (Length = 16 nucleotides) 

% nucleic acid sequence identity = 

30 

(the number of identically matching nucleotides between the two nucleic acid sequences as determined by 
ALIGN-2) divided by (the total number of nucleotides of the TAT-DNA nucleic acid sequence) = 

6 divided by 14 = 42.9% 
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TAT-DNA NNNNNNNNNNNN (Length = 12 nucleotides) 

Comparison DNA NNNNLLLW (Length = 9 nucleotides) 

5 % nucleic acid sequence identity = 

(the number of identically matching nucleotides between the two nucleic acid sequences as determined by 
ALIGN-2) divided by (the total number of nucleotides of the TAT-DNA nucleic acid sequence) = 

10 4 divided by 12 = 33.3% 

II. Compositions and Methods of the Invention 

A. Anti-TAT Antibodies 

In one embodiment, the present invention provides anti-TAT antibodies which may find use herein as 
15 therapeutic and/or diagnostic agents. Exemplary antibodies include polyclonal, monoclonal, humanized, 
bispecific, and heteroconjugate antibodies. 

1. Polyclonal Antibodies 

Polyclonal antibodies are preferably raised in animals by multiple subcutaneous (sc) or intraperitoneal 
(ip) injections of the relevant antigen and an adjuvant. It may be useful to conjugate the relevant antigen 

20 (especially when synthetic peptides are used) to a protein that is immunogenic in the species to be immunized. 

For example, the antigen can be conjugated to keyhole limpet hemocyanin (KLH), serum albumin, bovine 
thyroglobulin, or soybean trypsin inhibitor, using a bifunctional or derivatizing agent, e.g., maleimidobenzoyl 
sulfosuccinimide ester (conjugation through cysteine residues), N-hydroxysuccinimide (through lysine residues), 
glutaraldehyde, succinic anhydride, SOCl 2 , or R l N=C=NR, where R and R 1 are different alkyl groups. 

25 Animals are immunized against the antigen, immunogenic conjugates, or derivatives by combining, 

e.g., 100 ng or 5 [xg of the protein or conjugate (for rabbits or mice, respectively) with 3 volumes of Freund's 
complete adjuvant and injecting the solution intradermally at multiple sites. One month later, the animals are 
boosted with 1/5 to 1/10 the original amount of peptide or conjugate in Freund's complete adjuvant by 
subcutaneous injection at multiple sites. Seven to 14 days later, the animals are bled and the serum is assayed 

30 for antibody titer. Animals are boosted until the titer plateaus. Conjugates also can be made in recombinant 
cell culture as protein fusions. Also, aggregating agents such as alum are suitably used to enhance the immune 
response. 

2. Monoclonal Antibodies 

Monoclonal antibodies may be made using (he hybridoma method first described by Kohler et al., 
35 Nature, 256:495 (1975), or may be made by recombinant DNA methods (U.S. Patent No. 4,816,567). 

In the hybridoma method, a mouse or other appropriate host animal, such as a hamster, is immunized 
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as described above to elicit lymphocytes that produce or are capable of producing antibodies that will specifically 
bind to the protein used for immunization. Alternatively, lymphocytes may be immunized in vitro. After 
immunization, lymphocytes are isolated and then fused with a myeloma cell line using a suitable fusing agent, 
such as polyethylene glycol, to form a hybridoma cell (Goding, Monoclonal Antibodies: Principles and Practice. 
pp.59- 103 (Academic Press, 1986)). 

The hybridoma cells thus prepared are seeded and grown in a suitable culture medium which medium 
preferably contains one or more substances that inhibit the growth or survival of the unfused, parental myeloma 
cells (also referred to as fusion partner). For example, if the parental myeloma cells lack the enzyme 
hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), the selective culture medium for the 
hybridomas typically will include hypoxanthine, aminopterin, and thymidine (HAT medium), which substances 
prevent the growth of HGPRT-deficient cells. 

Preferred fusion partner myeloma cells are those that fuse efficiently, support stable high-level 
production of antibody by the selected antibody-producing cells, and are sensitive to a selective medium that 
selects against die unfused parental cells. Preferred myeloma cell lines are murine myeloma lines, such as those 
derived from MOPC-21 and MPC-11 mouse tumors available from the Salk Institute Cell Distribution Center, 
San Diego, California USA, and SP-2 and derivatives e.g., X63-Ag8-653 cells available from the American 
Type Culture Collection, Manassas, Virginia, USA. Human myeloma and mouse-human heteromyeloma cell 
lines also have been described for the production of human monoclonal antibodies (Kozbor, J. Immunol.. 
133:3001 (1984); andBrodeur et al., Monoclonal Antibody Production Techniques and Applications, pp. 51-63 
(Marcel Dekker, Inc., New York, 1987)). 

Culture medium in which hybridoma cells are growing is assayed for production of monoclonal 
antibodies directed against the antigen. Preferably, the binding specificity of monoclonal antibodies produced 
by hybridoma cells is determined by immunoprecipitation or by an in vitro binding assay, such as 
radioimmunoassay (RIA) or enzyme-linked immunosorbent assay (ELISA). 

The binding affinity of the monoclonal antibody can, for example, be determined by the Scatchard 
analysis described in Munson et al., Anal. Biochem.. 107:220 (1980). 

Once hybridoma cells that produce antibodies of the desired specificity, affinity, and/or activity are 
identified, the clones may be subcloned by limiting dilution procedures and grown by standard methods (Goding, 
Monoclonal Antibodies: Principles and Practice, pp.59-103 (Academic Press, 1986)). Suitable culture media 
for this purpose include, for example, D-MEM or RPMI-1640 medium. In addition, die hybridoma cells may 
be grown in vivo as ascites tumors in an animal e.g,, by i.p. injection of the cells into mice. 

The monoclonal antibodies secreted by the subclones are suitably separated from the culture medium, 
ascites fluid, or serum by conventional antibody purification procedures such as, for example, affinity 
chromatography (e.g., using protein A or protein G-Sepharose) or ion-exchange chromatography, 
hydroxylapatite chromatography, gel electrophoresis, dialysis, etc. 

DNA encoding the monoclonal antibodies is readily isolated and sequenced using conventional 
procedures (e.g., by using oligonucleotide probes that are capable of binding specifically to genes encoding the 
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heavy and light chains of murine antibodies). The hybridoma cells serve as a preferred source of such DNA. 
Once isolated, the DNA may be placed into expression vectors, which are then transfected into host cells such 
as E. coll cells, simian COS cells, Chinese Hamster Ovary (CHO) cells, or myeloma cells that do not otherwise 
produce antibody protein, to obtain the synthesis of monoclonal antibodies in the recombinant host cells . Review 
articles on recombinant expression in bacteria of DNA encoding the antibody include Skerra et al., Curr. 
5 Opinion in Immunol.. 5:256-262 (1993) and Pluckthun, Immunol. Revs. 130:151-188 (1992). 

In a further embodiment, monoclonal antibodies or antibody fragments can be isolated from antibody 
phage libraries generated using the techniques described in McCafferty et al., Nature. 348:552-554 (1990). 
Clackson et al., Nature. 352:624-628 (1991) and Marks et al., J. Mol. Biol.. 222:581-597 (1991) describe the 
isolation of murine and human antibodies, respectively, using phage libraries. Subsequent publications describe 
10 the production of high affinity (nM range) human antibodies by chain shuffling (Marks et al. , Bio/Technology , 
10:779-783 (1992)), as well as combinatorial infection and in vivo recombination as a strategy for constructing 
very large phage libraries (Waterhouse et al. , Nuc. Acids. Res. 21 :2265-2266 (1993)). Thus, these techniques 
are viable alternatives to traditional monoclonal antibody hybridoma techniques for isolation of monoclonal 
antibodies. 

15 The DNA that encodes the antibody may be modified to produce chimeric or fusion antibody 

polypeptides, for example, by substituting human heavy chain and light chain constant domain (C H and CJ 
sequences for the homologous murine sequences (U.S. Patent No. 4,816,567; and Morrison, et al., Proc. Natl 
Acad. Sci. USA. 81:6851 (1984)), or by fusing the immunoglobulin coding sequence with all or part of the 
coding sequence for a non-immunoglobulin polypeptide (heterologous polypeptide). The non-immunoglobulin 

20 polypeptide sequences can substitute for the constant domains of an antibody, or they are substituted for the 
variable domains of one antigen-combining site of an antibody to create a chimeric bivalent antibody comprising 
one antigen-combining site having specificity for an antigen and another antigen-combining site having 
specificity for a different antigen. 

3. Human and Humanized Antibodies 

25 The anti-TAT antibodies of the invention may further comprise humanized antibodies or human 

antibodies. Humanized forms of non-human (e.g., murine) antibodies are chimeric immunoglobulins, 
immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', F(ab') 2 or other antigen-binding 
subsequences of antibodies) which contain minimal sequence derived from non-human immunoglobulin. 
Humanized antibodies include human immunoglobulins (recipient antibody) in which residues from a 

3 0 complementary determining region (CDR) of the recipient are replaced by residues from a CDR of a non-human 
species (donor antibody) such as mouse, rat or rabbit having the desired specificity, affinity and capacity. In 
some instances, Fv framework residues of the human immunoglobulin are replaced by corresponding non-human 
residues. Humanized antibodies may also comprise residues which are found neither in the recipient antibody 
nor in the imported CDR or framework sequences. In general, the humanized antibody will comprise 

3 5 substantially all of at least one, and typically two, variable domains , in which all or substantially all of the CDR 
regions correspond to those of a non-human immunoglobulin and all or substantially all of the FR regions are 
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those of a human immunoglobulin consensus sequence. The humanized antibody optimally also will comprise 
at least a portion of an immunoglobulin constant region (Fc), typically that of a human immunoglobulin [Jones 
et al. f Nature. 321:522-525 (1986); Riechmann et al., Nature. 332:323-329 (1988); and Presta, Curr. Op. 
Struct. Biol.. 2:593-596 (1992)]. 

Methods for humanizing non-human antibodies are well known in the art. Generally, a humanized 
5 antibody has one or more amino acid residues introduced into it from a source which is non-human. These non- 
human amino acid residues are often referred to as "import" residues, which are typically taken from an 
" import " variable domain. Humanization can be essentially performed following the method of Winter and co- 
workers [Jones et al., Nature. 321:522-525 (1986); Riechmann et al., Nature. 332:323-327 (1988); Verhoeyen 
et al., Science. 239:1534-1536 (1988)], by substituting rodent CDRs or CDR sequences for the corresponding 

10 sequences of a human antibody. Accordingly, such "humanized" antibodies are chimeric antibodies (U.S. Patent 
No. 4,816,567), wherein substantially less than an intact human variable domain has been substituted by the 
corresponding sequence from a non-human species. In practice, humanized antibodies are typically human 
antibodies in which some CDR residues and possibly some FR residues are substituted by residues from 
analogous sites in rodent antibodies. 

15 The choice of human variable domains, both light and heavy, to be used in making the humanized 

antibodies is very important to reduce antigenicity and HAMA response (human anti-mouse antibody) when the 
antibody is intended for human therapeutic use. According to the so-called "best-fit" method, the sequence of 
the variable domain of a rodent antibody is screened against the entire library of known human variable domain 
sequences. The human V domain sequence which is closest to that of the rodent is identified and the human 

20 framework region (FR) within it accepted for the humanized antibody (Sims et al., J. Immunol. 151:2296 
(1993); Chothia et al., J. Mol. Biol.. 196:901 (1987)). Another method uses a particular framework region 
derived from the consensus sequence of all human antibodies of a particular subgroup of light or heavy chains. 
The same framework may be used for several different humanized antibodies (Carter et al., Proc. Natl. Acad. 
Sci. USA. 89:4285 (1992); Presta et al., J. Immunol. 151:2623 (1993)). 

25 It is further important that antibodies be humanized with retention of high binding affinity for the 

antigen and other favorable biological properties. To achieve this goal, according to a preferred method, 
humanized antibodies are prepared by a process of analysis of the parental sequences and various conceptual 
humanized products using three-dimensional models of the parental and humanized sequences. Three- 
dimensional immunoglobulin models are commonly available and are familiar to those skilled in the art. 

30 Computer programs are available which illustrate and display probable three-dimensional conformational 
structures of selected candidate immunoglobulin sequences. Inspection of these displays permits analysis of the 
likely role of the residues in the functioning of the candidate immunoglobulin sequence, i.e., the analysis of 
residues that influence the ability of the candidate immunoglobulin to bind its antigen. In this way, FR residues 
can be selected and combined from the recipient and import sequences so that the desired antibody 

35 characteristic, such as increased affinity for the target antigen(s), is achieved. In general, the hypervariable 
region residues are directly and most substantially involved in influencing antigen binding. 
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Various forms of a humanized anti-TAT antibody are contemplated. For example, the humanized 
antibody may be an antibody fragment, such as a Fab, which is optionally conjugated with one or more cytotoxic 
agent(s) in order to generate an immunoconjugate. Alternatively, the humanized antibody may be an intact 
antibody, such as an intact IgGl antibody. 

As an alternative to humanization, human antibodies can be generated. For example, it is now possible 
to produce transgenic animals (e.g., mice) that are capable, upon immunization, of producing a full repertoire 
of human antibodies in the absence of endogenous immunoglobulin production. For example, it has been 
described that the homozygous deletion of the antibody heavy-chain joining region (J H ) gene in chimeric and 
germ-line mutant mice results in complete inhibition of endogenous antibody production. Transfer of the human 
germ-line immunoglobulin gene array into such germ-line mutant mice will result in the production of human 
antibodies upon antigen challenge. See, e.g., Jakobovits et al., Proc. Natl. Acad. Sci. USA. 90:2551 (1993); 
Jakobovits et al., Nature, 362:255-258 (1993); Bruggemann et al., Year in Immuno. 7:33 (1993); U.S. Patent 
Nos. 5,545,806, 5,569,825, 5,591,669 (all of GenPharm); 5,545,807; and WO 97/17852. 

Alternatively, phage display technology (McCafferty et al., Nature 348:552-553 [1990]) can be used 
to produce human antibodies and antibody fragments in vitro, from immunoglobulin variable (V) domain gene 
repertoires from unimmunized donors. According to this technique, antibody V domain genes are cloned in- 
frame into either a major or minor coat protein gene of a filamentous bacteriophage, such as M13 or fd, and 
displayed as functional antibody fragments on the surface of the phage particle. Because the filamentous particle 
contains a single-stranded DNA copy of the phage genome, selections based on the functional properties of the 
antibody also result in selection of the gene encoding the antibody exhibiting those properties. Thus, the phage 
mimics some of the properties of the B-cell. Phage display can be performed in a variety of formats, reviewed 
in, e.g., Johnson, Kevin S. and Chiswell, David J., Current Opinion in Structural Biology 3:564-571 (1993). 
Several sources of V-gene segments can be used for phage display. Clackson et al. Nature. 352:624-628 (1991) 
isolated a diverse array of anti-oxazolone antibodies from a small random combinatorial library of V genes 
derived from the spleens of i mm u ni zed mice. A repertoire of V genes from unimmunized human donors can 
be constructed and antibodies to a diverse array of antigens (including self-antigens) can be isolated essentially 
following the techniques described by Marks et al. , J. Mol. Biol. 222:581-597 (1991), or Griffith et al. , EMBO 
L 12:725-734 (1993). See, also, U.S. Patent Nos. 5,565,332 and 5,573,905. 

As discussed above, human antibodies may also be generated by in vitro activated B cells (see U.S. 
Patents 5,567,610 and 5,229,275). 

4. Antibody fragments 

In certain circumstances there are advantages of using antibody fragments, rather than whole 
antibodies. The smaller size of the fragments allows for rapid clearance, and may lead to improved access to 
solid tumors. 

Various techniques have been developed for the production of antibody fragments. Traditionally, these 
fragments were derived via proteolytic digestion of intact antibodies (see, e.g., Morimoto et al., Journal of 
Biochemical and Biophysical Methods 24:107.117 (1992); and Brennan et al., Science. 229:81 (1985)). 
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However, these fragments can now be produced directly by recombinant host cells. Fab, Fv and ScFv antibody 
fragments can all be expressed in and secreted from E. coli, thus allowing the facile production of large amounts 
of these fragments. Antibody fragments can be isolated from the antibody phage libraries discussed above. 
Alternatively, Fab'-SH fragments can be directly recovered from E. coli and chemically coupled to form F(ab') 2 
fragments (Carter et al., Bio/Technology 10:163-167 (1992)). According to another approach, F(ab') 2 
fragments can be isolated directly from recombinant host cell culture. Fab and F(ab'i fragment with increased 
in vivo half-life comprising a salvage receptor binding epitope residues are described in U.S. Patent No. 
5,869,046. Other techniques for the production of antibody fragments will be apparent to the skilled 
practitioner. In other embodiments, the antibody of choice is a single chain Fv fragment (scFv). See WO 
93/16185; U.S. Patent No. 5,571,894; and U.S. Patent No. 5,587,458. Fv and sFv are the only species with 
intact combining sites that are devoid of constant regions; thus, they are suitable for reduced nonspecific binding 
during in vivo use. sFv fusion proteins may be constructed to yield fusion of an effector protein at either the 
amino or the carboxy terminus of an sFv. See Antibody Engineering ed. Borrebaeck, supra. The antibody 
fragment may also be a "linear antibody", e.g., as described in U.S. Patent 5,641 ,870 for example. Such linear 
antibody fragments may be monospecific or bispecific. 

5. Bispecific Antibodies 
Bispecific antibodies are antibodies that have binding specificities for at least two different epitopes. 
Exemplary bispecific antibodies may bind to two different epitopes of a TAT protein as described herein. Other 
such antibodies may combine a TAT binding site with a binding site for another protein. Alternatively, an anti- 
TAT arm may be combined with an arm which binds to a triggering molecule on a leukocyte such as a T-cell 
receptor molecule (e.g. CD3), or Fc receptors for IgG (Fc Y R), such as FcyRI (CD64), Fc Y RII (CD32) and 
FcyPJII (CD16), so as to focus and localize cellular defense mechanisms to the TAT-expressing cell. Bispecific 
antibodies may also be used to localize cytotoxic agents to cells which express TAT. These antibodies possess 
a TAT-binding arm and an arm which binds the cytotoxic agent (e.g. , saporin, anti-interferon-a, vinca alkaloid, 
ricin A chain, methotrexate or radioactive isotope hapten). Bispecific antibodies can be prepared as full length 
antibodies or antibody fragments (e.g., F(ab') 2 bispecific antibodies). 

WO 96/16673 describes a bispecific anti-ErbB2/anti-FcYRin antibody and U.S. Patent No. 5,837,234 
discloses a bispecific anti-ErbB2/anti-Fc Y RI antibody. A bispecific anti-ErbB2/Fc a antibody is shown in 
WO98/02463. U.S. Patent No. 5,821,337 teaches a bispecific anti-ErbB2/anti-CD3 antibody. 

Methods for making bispecific antibodies are known in the art. Traditional production of full length 
bispecific antibodies is based on the co-expression of two immunoglobulin heavy chain-light chain pairs, where 
the two chains have different specificities (Millstein et al. , Nature 305:537-539 (1983)). Because of the random 
assortment of immunoglobulin heavy and light chains, these hybridomas (quadromas) produce a potential 
mixture of 10 different antibody molecules, of which only one has the correct bispecific structure. Purification 
of the correct molecule, which is usually done by affinity chromatography steps, is rather cumbersome, and the 
product yields are low. Similar procedures are disclosed in WO 93/08829, and in Traunecker et al., EMBO 
L. 10:3655-3659 (1991). 
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According to a different approach, antibody variable domains with the desired binding specificities 
(antibody-antigen combining sites) are fused to immunoglobulin constant domain sequences. Preferably, the 
fusion is with an Ig heavy chain constant domain, comprising at least part of the hinge, C H 2, and C H 3 regions. 
It is preferred to have the first heavy-chain constant region (C H 1) containing the site necessary for tight chain 
bonding, present in at least one of the fusions. DNAs encoding the immunoglobulin heavy chain fusions and, 
if desired, the immunoglobulin light chain, are inserted into separate expression vectors, and are co-transfected 
into a suitable host cell. This provides for greater flexibility in adjusting the mutual proportions of the three 
polypeptide fragments in embodiments when unequal ratios of the three polypeptide chains used in the 
construction provide the optimum yield of the desired bispecific antibody. It is, however, possible to insert the 
coding sequences for two or all three polypeptide chains into a single expression vector when the expression of 
at least two polypeptide chains in equal ratios results in high yields or when the ratios have no significant affect 
on the yield of the desired chain combination. 

In a preferred embodiment of this approach, the bispecific antibodies are composed of a hybrid 
immunoglobulin heavy chain with a first binding specificity in one arm, and a hybrid immunoglobulin heavy 
chain-light chain pair (providing a second binding specificity) in the other arm. It was found that this 
asymmetric structure facilitates the separation of the desired bispecific compound from unwanted 
immunoglobulin chain combinations, as the presence of an immunoglobulin light chain in only one half of the 
bispecific molecule provides for a facile way of separation. This approach is disclosed in WO 94/04690. For 
further details of generating bispecific antibodies see, for example, Suresh et al., Methods in Enzvmolnp v 
121:210(1986). 

According to another approach described in U.S. Patent No. 5,731,168, the interface between a pair 
of antibody molecules can be engineered to maximize the percentage of heterodimers which are recovered from 
recombinant cell culture. The preferred interface comprises at least a part of the C H 3 domain. In this method, 
one or more small amino acid side chains from the interface of the first antibody molecule are replaced with 
larger side chains (e.g., tyrosine or tryptophan). Compensatory "cavities" of identical or similar size to the 
large side chain(s) are created on the interface of the second antibody molecule by replacing large amino acid 
side chains with smaller ones (e.g., alanine or threonine). This provides a mechanism for increasing the yield 
of the heterodimer over other unwanted end-products such as homodimers. 

Bispecific antibodies include cross-linked or "heteroconjugate" antibodies. For example, one of the 
antibodies in the heteroconjugate can be coupled to avidin, the other to biotin. Such antibodies have, for 
example, been proposed to target immune system cells to unwanted cells (U.S. Patent No. 4,676,980), and for 
treatment of HIV infection (WO 91/00360, WO 92/200373, and EP 03089). Heteroconjugate antibodies may 
be made using any convenient cross-linking methods. Suitable cross-linking agents are well known in the art, 
and are disclosed in U.S. Patent No. 4,676,980, along with a number of cross-linking techniques. 

Techniques for generating bispecific antibodies from antibody fragments have also been described in 
the literature. For example, bispecific antibodies can be prepared using chemical linkage. Brennan et al. , 
Science 229:81 (1985) describe a procedure wherein intact antibodies are proteolytically cleaved to generate 
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F(ab')2 fragments. These fragments are reduced in the presence of the dithiol completing agent, sodium 
arsenite, to stabilize vicinal dithiols and prevent intermolecular disulfide formation. The Fab' fragments 
generated are then converted to thionitrobenzoate (TNB) derivatives. One of the Fab*-TNB derivatives is then 
reconverted to the Fab'-thiol by reduction with mercaptoethylamine and is mixed with an equimolar amount of 
the other Fab' -TNB derivative to form the bispecific antibody. The bispecific antibodies produced can be used 
5 as agents for the selective immobilization of enzymes. 

Recent progress has facilitated the direct recovery of Fab'-SH fragments from E. coli, which can be 
chemically coupled to form bispecific antibodies. Shalaby et al., J. Exp. Med. 175: 217-225 (1992) describe 
the production of a fully humanized bispecific antibody F(aV)2 molecule. Each Fab 1 fragment was separately 
secreted from E. coli and subjected to directed chemical coupling in vitro to form the bispecific antibody. The 
1 0 bispecific antibody thus formed was able to bind to cells overexpressing the ErbB2 receptor and normal human 

T cells, as well as trigger the lytic activity of human cytotoxic lymphocytes against human breast tumor targets. 

"i 

Various techniques for making and isolating bispecific antibody fragments directly from recombinant 
cell culture have also been described. For example, bispecific antibodies have been produced using leucine 
zippers. Kostelny et al. J. Immunol. 148(5): 1547-1553 (1992). The leucine zipper peptides from the Fos and 

15 Jun proteins were linked to the Fab' portions of two different antibodies by gene fusion. The antibody 

homodimers were reduced at the hinge region to form monomers and then re-oxidized to form the antibody 
heterodimers. This method can also be utilized for the production of antibody homodimers. The "diabody" 
technology described by Hollinger et al., Proc. Natl. Acad. Sci. USA 90:6444-6448 (1993) has provided an 
alternative mechanism for making bispecific antibody fragments. The fragments comprise a V H connected to 

20 a V L by a linker which is too short to allow pairing between the two domains on the same chain. Accordingly, 
the V H and V L domains of one fragment are forced to pair with the complementary V L and V H domains of 
another fragment, thereby forming two antigen-binding sites. Another strategy for making bispecific antibody 
fragments by the use of single-chain Fv (sFv) dimers has also been reported. See Gruber et al.. J. Immunol.. 
152:5368 (1994). 

25 Antibodies with more than two valencies are contemplated. For example, trispecific antibodies can 

be prepared. Tutt et al.. J. Immunol. 147:60 (1991). 

6. Heteroconiugate Antibodies 

Heteroconjugate antibodies are also within the scope of the present invention. Heteroconjugate 
antibodies are composed oftwocovalently joined antibodies. Such antibodies have, for example, been proposed 

30 to target immune system cells to unwanted cells [U.S. Patent No. 4,676,980], and for treatment of HIV 

infection [WO 91/00360; WO 92/200373; EP 03089]. It is contemplated that the antibodies may be prepared 
in vitro using known methods in synthetic protein chemistry, including those involving crosslinking agents. For 
example, immunotoxins may be constructed using a disulfide exchange reaction or by forming a thioether bond. 
Examples of suitable reagents for this purpose include iminothiolate and methyl-4-mercaptobutyrimidate and 

35 those disclosed, for example, in U.S. Patent No. 4,676,980. 

7. Multivalent Antibodies 
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A multivalent antibody may be internalized (and/or catabolized) faster than a bivalent antibody by a 
cell expressing an antigen to which the antibodies bind. The antibodies of the present invention can be 
multivalent antibodies (which are other than of the IgM class) with three or more antigen binding sites (e.g. 
tetravalent antibodies), which can be readily produced by recombinant expression of nucleic acid encoding the 
polypeptide chains of the antibody. The multivalent antibody can comprise a dimerization domain and three or 
5 more antigen binding sites. The preferred dimerization domain comprises (or consists of) an Fc region or a 
hinge region. In this scenario, the antibody will comprise an Fc region and three or more antigen binding sites 
amino-terminal to the Fc region. The preferred multivalent antibody herein comprises (or consists of) three to 
about eight, but preferably four, antigen binding sites. The multivalent antibody comprises at least one 
polypeptide chain (and preferably two polypeptide chains), wherein the polypeptide chain(s) comprise two or 

10 more variable domains. For instance, the polypeptide chain(s) may comprise VD1-(X1) n -VD2-(X2) n -Fc, 
wherein VD1 is a first variable domain, VD2 is a second variable domain, Fc is one polypeptide chain of an 
Fc region, XI and X2 represent an amino acid or polypeptide, and n is 0 or 1. For instance, the polypeptide 
chain(s) may comprise: VH-CH1 -flexible linker-VH-CHl-Fc region chain; or VH-CHl-VH-CHl-Fc region 
chain. The multivalent antibody herein preferably further comprises at least two (and preferably four) light chain 

1 5 variable domain polypeptides. The multivalent antibody herein may, for instance, comprise from about two to 
about eight light chain variable domain polypeptides. The light chain variable domain polypeptides contemplated 
here comprise a light chain variable domain and, optionally, further comprise a CL domain. 
8. Effector Function Engineering 
It may be desirable to modify the antibody of the invention with respect to effector function, e.g., so 

20 as to enhance antigen-dependent cell-mediated cyotoxicity (ADCC) and/or complement dependent cytotoxicity 
(CDC) of the antibody. This may be achieved by introducing one or more amino acid substitutions in an Fc 
region of the antibody. Alternatively or additionally, cysteine residue(s) may be introduced in the Fc region, 
thereby allowing interchain disulfide bond formation in this region. The homodimeric antibody thus generated 
may have improved internalization capability and/or increased complement-mediated cell killing and antibody- 

25 dependent cellular cytotoxicity (ADCC). See Caron et al. J. Exp Med. 176:1191-1195 (1992) and Shopes, B. 

J. Immunol. 148:2918-2922 (1992). Homodimeric antibodies with enhanced anti-tumor activity may also be 
prepared using heterobifunctional cross-linkers as described in Wolff et al., Cancer Research 53:2560-2565 
(1993). Alternatively, an antibody can be engineered which has dual Fc regions and may thereby have enhanced 
complement lysis and ADCC capabilities. See Stevenson et al., Anti-Cancer Drug Design 3:219-230 (1989). 

30 To increase the serum half life of the antibody, one may incorporate a salvage receptor binding epitope 

into the antibody (especially an antibody fragment) as described in U.S. Patent 5,739,277, for example. As used 
herein, the term "salvage receptor binding epitope" refers to an epitope of the Fc region of an IgG molecule 
(e.g., IgG lf IgG 2 , IgG 3 , or IgG 4 ) that is responsible for increasing the in vivo serum half-life of the IgG 
molecule. 

35 9. Immunoconjugates 

The invention also pertains to immunoconjugates comprising an antibody conjugated to a cytotoxic 
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agent such as a chemotherapeutic agent, a growth inhibitory agent, a toxin (e.g. , an enzymatically active toxin 
of bacterial, fungal, plant, or animal origin, or fragments thereof), or a radioactive isotope ( i.e., a 
radioconjugate) . 

Chemotherapeutic agents useful in the generation of such immunoconjugates have been described 
above. Enzymatically active toxins and fragments thereof that can be used include diphtheria A chain, 
nonbinding active fragments of diphtheria toxin, exotoxin A chain (from Pseudomonas aeruginosa), ricin A 
chain, abrin A chain, modeccin A chain, alpha-sarcin, Aleurites fordii proteins, dianthin proteins, Phytolaca 
americana proteins (PAPI, PAPII, and PAP-S), momordica charantia inhibitor, curcin, crotin, sapaonaria 
officinalis inhibitor, gelonin, mitogellin, restrictocin, phenomycin, enomycin, and the tricothecenes. A variety 
of radionuclides are available for the production of radioconjugated antibodies. Examples include 2I2 Bi, 13l I, 
131 In, 90 Y 9 and 186 Re. Conjugates of the antibody and cytotoxic agent are made using a variety of bifimctional 
protein-coupling agents such as N-succinimidyl-3-(2-pyridyldithiol) propionate (SPDP), iminothiolane (IT), 
bifunctional derivatives of imidoesters (such as dimethyl adipimidate HCL), active esters (such as disuccinimidyl 
suberate), aldehydes (such as glutareldehyde), bis-azido compounds (such as bis (p-azidobenzoyl) 
hexanediamine), bis-diazonium derivatives (such as bis-(p-diazoniumben2oyl)-ethylenediamine), diisocyanates 
(such as tolyene2,6-diisocyanate), and bis-active fluorine compounds (such as l,5-difluoro-2,4-dinitrobenzene) 
For example, a ricin immunotoxin can be prepared as described in Vitetta et al. , Science, 238: 1098 (1987). 
Carbon-14-labeled l-isothiocyanatobenzyl-3-methyldiethylene triaminepentaacetic acid (MX-DTPA) is an 
exemplary chelating agent for conjugation of radionucleotide to the antibody. See WO94/11026. 

Conjugates of an antibody and one or more small molecule toxins, such as a calicheamicin, 
maytansinoids, a trichothene, and CC1065, and the derivatives of these toxins that have toxin activity, are also 
contemplated herein. 
Maytansine and maytansinoids 

In one preferred embodiment, an anti-TAT antibody (full length or fragments) of the invention is 
conjugated to one or more maytansinoid molecules. 

Maytansinoids are mitototic inhibitors which act by inhibiting tubulin polymerization. Maytansine was 
first isolated from the east African shrub Maytenus serrata (U.S. Patent No. 3,896, 1 1 1). Subsequently, it was 
discovered that certain microbes also produce maytansinoids, such as maytansinol and C-3 maytansinol esters 
(U.S. Patent No. 4, 151,042). Synthetic maytansinol and derivatives and analogues thereof are disclosed, for 
example, in U.S. Patent Nos. 4,137,230; 4,248,870; 4,256,746; 4,260,608; 4,265,814; 4,294,757; 4,307,016; 
4,308,268; 4,308,269; 4,309,428; 4,313,946; 4,315,929; 4,317,821; 4,322,348; 4,331,598; 4,361,650; 
4,364,866; 4,424,219; 4,450,254; 4,362,663; and 4,371,533, the disclosures of which are hereby expressly 
incorporated by reference. 
Mavtansinoid-antibodv conjugates 

In an attempt to improve their therapeutic index, maytansine and maytansinoids have been conjugated 
to antibodies specifically binding to tumor cell antigens. Immunoconjugates containing maytansinoids and their 
therapeutic use are disclosed, for example, in U.S. Patent Nos. 5,208,020, 5,416,064 and European Patent EP 
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0 425 235 Bl , the disclosures of which are hereby expressly incorporated by reference. Liu et al.. Proc. Natl. 
Acad. Sci. USA 93:8618-8623 (1996) described immunoconjugates comprising a maytansinoid designated DM 1 
linked to the monoclonal antibody C242 directed against human colorectal cancer. The conjugate was found 
to be highly cytotoxic towards cultured colon cancer cells, and showed antitumor activity in an in vivo tumor 
growth assay. Charietal., Cancer Research 52:127-131 (1992) describe immunoconjugates in which a 
5 maytansinoid was conjugated via a disulfide linker to the murine antibody A7 binding to an antigen on human 
colon cancer cell lines, or to another murine monoclonal antibody TA.l that binds the HER-2/neu oncogene. 
The cytotoxicity of the TA. 1-maytansonoid conjugate was tested in vitro on the human breast cancer cell line 
SK-BR-3, which expresses 3 x 10 5 HER-2 surface antigens per cell. The drug conjugate achieved a degree of 
cytotoxicity similar to the free maytansonid drug, which could be increased by increasing the number of 
1 0 maytansinoid molecules per antibody molecule. The A7-maytansinoid conjugate showed low systemic 
cytotoxicity in mice. 

Anti-TAT polypeptide antibodv-mavtansinoid conjugates (immunoconjugates) 

Anti-TAT antibody-maytansinoid conjugates are prepared by chemically linking an anti-TAT antibody 
to a maytansinoid molecule without significantly diminishing the biological activity of either the antibody or the 

15 maytansinoid molecule. An average of 3-4 maytansinoid molecules conjugated per antibody molecule has 

shown efficacy in enhancing cytotoxicity of target cells without negatively affecting the function or solubility 
of the antibody, although even one molecule of toxin/antibody would be expected to enhance cytotoxicity over 
the use of naked antibody. Maytansinoids are well known in the art and can be synthesized by known techniques 
or isolated from natural sources. Suitable maytansinoids are disclosed, for example, in U.S. Patent No. 

20 5,208,020 and in the other patents and nonpatent publications referred to hereinabove. Preferred maytansinoids 
are maytansinol and maytansinol analogues modified in the aromatic ring or at other positions of the maytansinol 
molecule, such as various maytansinol esters. 

There are many linking groups known in the art for making antibody-maytansinoid conjugates, 
including, for example, those disclosed in U.S. Patent No. 5,208,020 or EP Patent 0 425 235 Bl, and Chari 

25 et al., Cancer Research 52:127-131 (1992). The linking groups include disufide groups, thioether groups, acid 
labile groups, photolabile groups, peptidase labile groups, or esterase labile groups, as disclosed in the above- v 
identified patents, disulfide and thioether groups being preferred. 

Conjugates of the antibody and maytansinoid may be made using a variety of bifimctional protein 
coupling agents such as N-succinimidyl-3-(2-pyridyldithio) propionate (SPDP), succinimidyl-4-(N- 

30 maleimidomethyl) cyclohexane-l-carboxylate, iminothiolane (IT), bifimctional derivatives of imidoesters (such 
as dimethyl adipimidate HCL), active esters (such as disuccinimidyl suberate), aldehydes (such as 
glutareldehyde), bis-azido compounds (such as bis (p-azidobenzoyl) hexanediamine), bis-diazonium derivatives 
(such as bis-(p-diazoniumbenzoyl)-ethylenediamine), diisocyanates (such as toluene 2,6-diisocyanate), and bis- 
active fluorine compounds (such as l,5-difluoro-2,4-dinitrobenzene). Particularly preferred coupling agents 

35 include N-succinimidyl-3-(2-pyridylditbio) propionate (SPDP) (Carlsson et al. Biochem. J. 173:723-737 [1978]) 
and N-succinimidyl-4-(2-pyridylthio)pentanoate (SPP) to provide for a disulfide linkage. 
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The linker may be attached to the maytansinoid molecule at various positions, depending on the type 
of the link. For example, an ester linkage may be formed by reaction with a hydroxyl group using conventional 
coupling techniques. The reaction may occur at the C-3 position having a hydroxyl group, the C-14 position 
modified with hyrdoxymethyl, the C-15 position modified with a hydroxyl group, and the C-20 position having 
a hydroxyl group. In a preferred embodiment, the linkage is formed at die C-3 position of maytansinol or a 
5 maytansinol analogue. 
Calicheamicin 

Another immunoconjugate of interest comprises an anti-TAT antibody conjugated to one or more 
calicheamicin molecules. The calicheamicin family of antibiotics are capable of producing double-stranded 
DNA breaks at sub-picomolar concentrations. For the preparation of conjugates of the calicheamicin family, 

10 see U.S. patents 5,712,374, 5,714,586, 5,739,116, 5,767,285, 5,770,701, 5,770,710, 5,773,001, 5,877,296 
(all to American Cyanamid Company). Structural analogues of calicheamicin which may be used include, but 
are not limited to, y/, o^ 1 , 0C3 1 , N-acetyl-y/, PSAG and B\ (Hinman et al., Cancer Research 53:3336-3342 
(1993), Lode et al., Cancer Research 58:2925-2323 (1998) and the aforementioned U.S. patents to American 
Cyanamid). Another anti-tumor drug that the antibody can be conjugated is QFA which is an antifolate. Both 

15 calicheamicin and QFA have intracellular sites of action and do not readily cross the plasma membrane. 

Therefore, cellular uptake of these agents through antibody mediated internalization greatly enhances their 
cytotoxic effects. 
Other cytotoxic agents 

Other antitumor agents that can be conjugated to the anti-TAT antibodies of the invention include 
20 BCNU, streptozoicin, vincristine and 5-fluorouracil, the family of agents known collectively LL-E33288 
complex described in U.S. patents 5,053,394, 5,770,710, as well as esperamicins (U.S. patent 5,877,296). 

Enzymatically active toxins and fragments thereof which can be used include diphtheria A chain, 
nonbinding active fragments of diphtheria toxin, exotoxin A chain (from Pseudomonas aeruginosa), ricin A 
25 chain, abrin A chain, modeccin A chain, alpha-sarcin, Aleurites fordii proteins, dianthin proteins, Phytolaca 
ainericana proteins (PAPI, PAPH, and PAP-S), momordica charantia inhibitor, curcin, crotin, sapaoharia 
officinalis inhibitor, gelonin, mitogellin, restrictocin, phenomycin, enomycin and the tricothecenes. See, for 
example, WO 93/21232 published October 28, 1993. 

The present invention further contemplates an immunoconjugate formed between an antibody and a 
30 compound with nucleolytic activity (e.g. , a ribonuclease or a DNA endonuclease such as a deoxyribonuclease; 
DNase). 

For selective destruction of the tumor, the antibody may comprise a highly radioactive atom. A variety 
of radioactive isotopes are available for the production of radioconjugated anti-TAT antibodies. Examples 
include At 211 , 1 131 , 1 125 , Y 90 , Re 186 , Re 188 , Sm 153 , Bi 212 , P 32 , Pb 212 and radioactive isotopes of Lu. When the 
35 conjugate is used for diagnosis, it may comprise a radioactive atom for scintigraphic studies, for example tc 99m 
or I m > or a spin label for nuclear magnetic resonance (NMR) imaging (also known as magnetic resonance 
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imaging, mri), such as iodine- 123 again, iodine-131, indium-111, fluorine-19, carbon-13, nitrogen-15, oxygen- 
17, gadolinium, manganese or iron. 

The radio- or other labels may be incorporated in the conjugate in known ways. For example, the 
peptide may be biosynthesized or may be synthesized by chemical amino acid synthesis using suitable amino 
acid precursors involving, for example, fluorine-19 in place of hydrogen. Labels such as fa? 9 ™ or I 123 , .Re 186 , 
5 Re 188 and In 111 can be attached via a cysteine residue in the peptide. Yttrium-90 can be attached via a lysine 
residue. The IODOGEN method (Fraker et al (1978) Biochem. Biophys. Res. Commun. 80: 49-57 can be used 
to incorporate iodine-123. "Monoclonal Antibodies inlmmunoscintigraphy" (Chatal,CRC Press 1989) describes 
other methods in detail. 

Conjugates of the antibody and cytotoxic agent may be made using a variety of bifunctional protein 

10 coupling agents such as N-succinimidyl-3-(2-pyridyldithio) propionate (SPDP), succinimidyl-4-(N- 

maleimidomethyl) cyclohexane-l-carboxylate, iminothiolane (IT), bifunctional derivatives of imidoesters (such 
as dimethyl adipimidate HCL), active esters (such as disuccinimidyl suberate), aldehydes (such as 
glutareldehyde), bis-azido compounds (such as bis (p-azidobenzoyl) hexanediamine), bis-diazonium derivatives 
(such as bis-(p-diazoniumbenzoyl)-ethylenediamine), diisocyanates (such as tolyene 2,6-diisocyanate), and bis- 

15 active fluorine compounds (such as l,5-difluoro-2,4-dimtrobenzene). For example, a ricin immunotoxin can 
be prepared as described in Vitetta et al., Science 238: 1098 (1987). Carbon- 14-labeled 1-isothiocyanatobenzyl- 
3-methyldiethylene triaminepentaacetic acid (MX-DTPA) is an exemplary chelating agent for conjugation of 
radionucleotide to the antibody. See WO94/11026. The linker may be a "cleavable linker" facilitating release 
of the cytotoxic drug in the cell. For example, an acid-labile linker, peptidase-sensitive linker, photolabile 

20 linker, dimethyl linker or disulflde-containing linker (Chari et al., Cancer Research 52:127-131 (1992); U.S. 
Patent No. 5,208,020) may be used. 

Alternatively, a fusion protein comprising the anti-TAT antibody and cytotoxic agent may be made, 
e.g., by recombinant techniques or peptide synthesis. The length of DNA may comprise respective regions 
encoding the two portions of the conjugate either adjacent one another or separated by a region encoding a linker 

25 peptide which does not destroy the desired properties of the conjugate. 

In yet another embodiment, the antibody may be conjugated to a "receptor" (such streptavidin) for 
utilization in tumor pre-targeting wherein the antibody-receptor conjugate is administered to the patient, followed 
by removal of unbound conjugate from the circulation using a clearing agent and then administration of a 
"ligand" (e.g., avidin) which is conjugated to a cytotoxic agent (e.g., a radionucleotide). 

30 10. Immunoliposomes 

The anti-TAT antibodies disclosed herein may also be formulated as immunoliposomes. A "liposome" 
is a small vesicle composed of various types of lipids, phospholipids and/or surfactant which is useful for 
delivery of a drug to a mammal. The components of the liposome are commonly arranged in a bilayer 
formation, similar to the lipid arrangement of biological membranes. Liposomes containing the antibody are 

35 prepared by methods known in the art, such as described in Epstein et al. , Proc. Natl. Acad. Sci. USA 82:3688 
(1985); Hwang et al.. Proc. Natl Acad. Sci. USA 77:4030 (1980); U.S. Pat. Nos. 4,485,045 and 4,544,545; 
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and W097/38731 published October 23, 1997. Liposomes with enhanced circulation time are disclosed in U.S. 
Patent No. 5,013,556. 

Particularly useful liposomes can be generated by the reverse phase evaporation method with a lipid 
composition comprising phosphatidylcholine, cholesterol and PEG-derivatized phosphatidyiethanolamine (PEG- 
PE). Liposomes are extruded through filters of defined pore size to yield liposomes with the desired diameter. 
5 Fab' fragments of the antibody of the present invention can be conjugated to the liposomes as described in 

Martin et al., J. Biol. Chem. 257:286-288 (1982) via a disulfide interchange reaction. A chemotherapeutic 
agent is optionally contained within the liposome. See Gabizon et al., J. National Cancer Inst. 81(19): 1484 
(1989). 

B. TAT Binding Oligopeptides 

1 0 TAT binding oligopeptides of the present invention are oligopeptides that bind, preferably specifically, 

to a TAT polypeptide as described herein. TAT binding oligopeptides may be chemically synthesized using 
known oligopeptide synthesis methodology or may be prepared and purified using recombinant technology. 
TAT binding oligopeptides are usually at least about 5 amino acids in length, alternatively at least about 6, 7, 
8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 

15 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 
65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 
93, 94, 95, 96, 97, 98, 99, or 100 amino acids in length or more, wherein such oligopeptides that are capable 
of binding, preferably specifically, to a TAT polypeptide as described herein. TAT binding oligopeptides may 
be identified without undue experimentation using well known techniques. In this regard, it is noted that 

20 techniques for screening oligopeptide libraries for oligopeptides that are capable of specifically binding to a 
polypeptide target are well known in the art (see, e.g., U.S. Patent Nos. 5,556,762, 5,750,373, 4,708,871, 
4,833,092, 5,223,409 , 5,403,484, 5,571,689, 5,663,143; PCT Publication Nos. WO 84/03506 and 
WO84/03564; Geysenetal., Proc. Natl. Acad. Sci. U.S.A., 81:3998-4002(1984); Geysenetal., Proc. Natl. 
Acad. Sci. U.S.A., 82:178-182 (1985); , Geysen et al., in Synthetic Peptides as Antigens, 130-149 (1986); 

25 Geysen et al., J. Immunol. Meth., 102:259-274 (1987); Schoofs et al., J. Immunol., 140:611-616 (1988), 
Cwirla, S. E. et al. (1990) Proc. Natl. Acad. Sci. USA, 87:6378; Lowman, H.B. et al. (1991) Biochemistry, 
30:10832; Clackson, T. et al. (1991) Nature, 352: 624; Marks, J. D. et al. (1991), J. Mol. Biol., 222:581; 
Kang, A.S. et al. (1991) Proc. Natl. Acad. Sci. USA, 88:8363, and Smith, G. P. (1991) Current Opin. 
Biotechnol., 2:668). 

30 In this regard, bacteriophage (phage) display is one well known technique which allows one to screen 

large oligopeptide libraries to identify member(s) of those libraries which are capable of specifically binding 
to a polypeptide target. Phage display is a technique by which variant polypeptides are displayed as fusion 
proteins to the coat protein on the surface of bacteriophage particles (Scott, J.K. and Smith, G. P. (1990) 
Science 249: 386). The utility of phage display lies in the fact that large libraries of selectively randomized 

3 5 protein variants (or randomly cloned cDNAs) can be rapidly and efficiently sorted for those sequences that bind 
to a target molecule with high affinity. Display of peptide (Cwirla, S. E. et al. (1990) Proc. Natl. Acad. Sci. 
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USA, 87:6378) or protein (Lowman, H.B. et al. (1991) Biochemistry, 30:10832; Clackson, T. et al. (1991) 
Nature, 352: 624; Marks, J. D. et al. (1991), J. Mol. Biol., 222:581; Kang, A.S. et al. (1991) Proc. Natl. 
Acad. Sci. USA, 88:8363) libraries on phage have been used for screening millions of polypeptides or 
oligopeptides for ones with specific binding properties (Smith, G. P. (1991) Current Opin. Biotechnol. , 2:668). 
Sorting phage libraries of random mutants requires a strategy for constructing and propagating a large number 
of variants, a procedure for affinity purification using the target receptor, and a means of evaluating the results 
of binding enrichments. U.S. Patent Nos. 5,223,409, 5,403,484, 5,571,689, and 5,663,143. 

Although most phage display methods have used filamentous phage, lambdoid phage display systems 
(WO 95/34683; U.S. 5,627,024), T4 phage display systems (Ren, Z-J. et al. (1998) Gene 215:439; Zhu, Z. 
(1997) CAN 33:534; Jiang, J. et al. (1997) can 128:44380; Ren, Z-J. et al. (1997) CAN 127:215644; Ren, Z-J. 
(1996) Protein Sci. 5:1833; Efimov, V. P. et al. (1995) Virus Genes 10:173) and T7 phage display systems 
(Smith, G. P. and Scott, J.K. (1993) Methods in Enzymology, 217, 228-257; U.S. 5,766,905) are also known. 

Many other improvements and variations of the basic phage display concept have now been developed. 
These improvements enhance the ability of display systems to screen peptide libraries for binding to selected 
target molecules and to display functional proteins with the potential of screening these proteins for desired 
properties. Combinatorial reaction devices for phage display reactions have been developed (WO 98/14277) 
and phage display libraries have been used to analyze and control bimolecular interactions (WO 98/20169; WO 
98/20159) and properties of constrained helical peptides (WO 98/20036). WO 97/35196 describes a method 
of isolating an affinity ligand in which a phage display library is contacted with one solution in which the ligand 
will bind to a target molecule and a second solution in which the affinity ligand will not bind to the target 
molecule, to selectively isolate binding ligands. WO 97/46251 describes a method of biopanning a random 
phage display library with an affinity purified antibody and then isolating binding phage, followed by a 
micropanning process using microplate wells to isolate high affinity binding phage. The use ofStaphtylococcus 
aureus protein A as an affinity tag has also been reported (Li et al. (1998) Mol Biotech., 9:187). WO 97/47314 
describes the use of substrate subtraction libraries to distinguish enzyme specificities using a combinatorial 
library which may be a phage display library. A method for selecting enzymes suitable for use in detergents 
using phage display is described in WO 97/09446. Additional methods of selecting specific binding proteins 
are described in U.S. Patent Nos. 5,498,538, 5,432,018, and WO 98/15833. 

Methods of generating peptide libraries and screening these libraries are also disclosed in U.S. Patent 
Nos. 5,723,286, 5,432,018, 5,580,717, 5,427,908, 5,498,530, 5,770,434, 5,734,018, 5,698,426, 5,763,192, 
and 5,723,323. 

C. TAT Bindine Organic Molecules 

TAT binding organic molecules are organic molecules other than oligopeptides or antibodies as defined 
herein that bind, preferably specifically, to a TAT polypeptide as described herein. TAT binding organic 
molecules may be identified and chemically synthesized using known methodology (see, e.g. , PCT Publication 
Nos. WO00/00823 and WO00/39585). TAT binding organic molecules are usually less than about 2000 daltons 
in size, alternatively less than about 1500, 750, 500, 250 or 200 daltons in size, wherein such organic molecules 
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that are capable of binding, preferably specifically, to a TAT polypeptide as described herein may be identified 
without undue experimentation using well known techniques. In this regard, it is noted that techniques for 
screening organic molecule libraries for molecules that are capable of binding to a polypeptide target are well 
known in the art (see, e.g., PCT Publication Nos. WO00/00823 and WO00/39585). TAT binding organic 
molecules may be, for example, aldehydes, ketones, oximes, hydrazones, semicarbazones, carbazides, primary 
5 amines, secondary amines, tertiary amines, N-substituted hydrazines, hydrazides, alcohols, ethers, thiols, 
thioethers, disulfides, carboxylic acids, esters, amides, ureas, carbamates, carbonates, ketals, thioketals, acetals, 
thioacetals, aryl halides, aryl sulfonates, alkyl halides, alkyl sulfonates, aromatic compounds, heterocyclic 
compounds, anilines, alkenes, alkynes, diols, amino alcohols, oxazolidines, oxazolines, thiazolidines, 
thiazolines, enamines, sulfonamides, epoxides, aziridines, isocyanates, sulfonyl chlorides, diazo compounds, 

10 acid chlorides, or the like. 

D. Screening for Anti-TAT Antibodies. TAT Binding Oligopeptides and TAT Binding Organic 

Molecules With the Desired Properties 
Techniques for generating antibodies, oligopeptides and organic molecules that bind to TAT 
polypeptides have been described above. One may further select antibodies, oligopeptides or other organic 

1 5 molecules with certain biological characteristics, as desired. 

The growth inhibitory effects of an anti-TAT antibody, oligopeptide or other organic molecule of the 
invention may be assessed by methods known in the art, e.g., using cells which express a TAT polypeptide 
either endogenously or following transfection with the TAT gene. For example, appropriate tumor cell lines 
and TAT-transfected cells may treated with an anti-TAT monoclonal antibody, oligopeptide or other organic 

20 molecule of the invention at various concentrations for a few days (e.g. , 2-7) days and stained with crystal violet 
or MTT or analyzed by some other colorimetric assay. Another method of measuring proliferation would be 
by comparing 3 H-thymidine uptake by the cells treated in the presence or absence an anti-TAT antibody, TAT 
binding oligopeptide or TAT binding organic molecule of the invention. After treatment, the cells are harvested 
and the amount of radioactivity incorporated into the DNA quantitated in a scintillation counter. Appropriate 

25 positive controls include treatment of a selected cell line with a growth inhibitory antibody known to inhibit 
growth of that cell line. Growth inhibition of tumor cells in vivo can be determined in various ways known in 
the art. Preferably, the tumor cell is one that overexpresses a TAT polypeptide. Preferably, the anti-TAT 
antibody, TAT binding oligopeptide or TAT binding organic molecule will inhibit cell proliferation of a TAT- 
expressing tumor cell in vitro or in vivo by about 25-100% compared to the untreated tumor cell, more 

30 preferably, by about 30-100%, and even more preferably by about 50-100% or 70-100%, in one embodiment, 
at an antibody concentration of about 0.5 to 30 jig/ml. Growth inhibition can be measured at an antibody 
concentration of about 0.5 to 30 jig/ml or about 0.5 nM to 200 nM in cell culture, where the growth inhibition 
is determined 1-10 days after exposure of the tumor cells to the antibody. The antibody is growth inhibitory 
in vivo if administration of the anti-TAT antibody at about 1 ng/kg to about 100 mg/kg body weight results in 

35 reduction in tumor size or reduction of tumor cell proliferation within about 5 days to 3 months from the first 
administration of the antibody, preferably within about 5 to 30 days. 
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To select for an anti-TAT antibody, TAT binding oligopeptide or TAT binding organic molecule which 
induces cell death, loss of membrane integrity as indicated by, e.g. , propidium iodide (PI), trypan blue or 7AAD 
uptake may be assessed relative to control. A PI uptake assay can be performed in the absence of complement 
and immune effector cells. TAT polypeptide-expressing tumor cells are incubated with medium alone or 
medium containing the appropriate anti-TAT antibody (e.g, at about 10ng/ml), TAT binding oligopeptide or 
5 TAT binding organic molecule. The cells are incubated for a 3 day time period. Following each treatment, 
cells are washed and aliquoted into 35 mm strainer-capped 12 x 75 tubes (1ml per tube, 3 tubes per treatment 
group) for removal of cell clumps. Tubes then receive PI (10 ng/ml). Samples may be analyzed using a 
FACSCAN® flow cytometer and FACSCONVERT® CellQuest software (Becton Dickinson). Those anti-TAT 
antibodies, TAT binding oligopeptides or TAT binding organic molecules that induce statistically significant 

1 0 levels of cell death as determined by PI uptake may be selected as cell death-inducing anti-TAT antibodies , TAT 
binding oligopeptides or TAT binding organic molecules. 

To screen for antibodies, oligopeptides or other organic molecules which bind to an epitope on a TAT 
polypeptide bound by an antibody of interest, a routine cross-blocking assay such as that described in 
Antibodies. A Laboratory Manual, Cold Spring Harbor Laboratory, Ed Harlow and David Lane (1988), can 

15 be performed. This assay can be used to determine if a test antibody, oligopeptide or other organic molecule 
binds the same site or epitope as a known anti-TAT antibody. Alternatively, or additionally, epitope mapping 
can be performed by methods known in the art . For example, the antibody sequence can be mutagenized such 
as by alanine scanning, to identify contact residues. The mutant antibody is initailly tested for binding with 
polyclonal antibody to ensure proper folding. In a different method, peptides corresponding to different regions 

20 of a TAT polypeptide can be used in competition assays with the test antibodies or with a test antibody and an 
antibody with a characterized or known epitope. 

E. Antibody Dependent Enzyme Mediated Prodrug Therapy (ADEPT) 

The antibodies of the present invention may also be used in ADEPT by conjugating the antibody to a 
prodrug-activating enzyme which converts a prodrug (e.g., a peptidyl chemotherapeutic agent, see 
25 WO81/01145) to an active anti-cancer drug. See, for example, WO 88/07378 and U.S. Patent No. 4,975,278. 

The enzyme component of the immunoconjugate useful for ADEPT includes any enzyme capable of 
acting on a prodrug in such a way so as to covert it into its more active, cytotoxic form. 

Enzymes that are useful in the method of this invention include, but are not limited to, alkaline 
phosphatase useful for converting phosphate-containing prodrugs into free drugs; arylsulfatase useful for 
30 converting sulfate-containing prodrugs into free drugs; cytosine deaminase useful for converting non-toxic 5- 
fluorocytosine into the anti-cancer drug, 5-fluorouracil; proteases, such as serratia protease, thermolysin, 
subtilisin, carboxypeptidases and cathepsins (such as cathepsins B and L), that are useful for converting peptide- 
containing prodrugs into free drugs; D-alanylcarboxypeptidases, useful for converting prodrugs that contain D- 
amino acid substituents; carbohydrate-cleaving enzymes such as P-galactosidase and neuraminidase useful for 
35 converting glycosylated prodrugs into free drugs; P-lactamase useful for converting drugs derivatized with P- 
lactams into free drugs; and penicillin amidases, such as penicillin V amidase or penicillin G amidase, useful 
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for converting drugs derivatized at their amine nitrogens with phenoxyacetyl or phenylacetyl groups, 
respectively, into free drugs. Alternatively, antibodies with enzymatic activity, also known in the art as 
"abzymes", can be used to convert the prodrugs of the invention into free active drugs (see, e.g., Massey, 
Nature 328:457-458 (1987)). Antibody-abzyme conjugates can be prepared as described herein for delivery of 
the abzyme to a tumor cell population. 
5 The enzymes of this invention can be covalently bound to the anti-TAT antibodies by techniques well 

known in the art such as the use of the heterobifunctional crosslinking reagents discussed above. Alternatively, 
fusion proteins comprising at least the antigen binding region of an antibody of the invention linked to at least 
a functionally active portion of an enzyme of the invention can be constructed using recombinant DNA 
techniques well known in the art (see, e.g., Neuberger et al., Nature 312:604-608 (1984). 
10 F. Full-Length TAT Polypeptides 

The present invention also provides newly identified and isolated nucleotide sequences encoding 
polypeptides referred to in the present application as TAT polypeptides. In particular, cDNAs (partial and full- 
length) encoding various TAT polypeptides have been identified and isolated, as disclosed in further detail in 
the Examples below. 

15 As disclosed in the Examples below, various cDNA clones have been deposited with the ATCC. The 

actual nucleotide sequences of those clones can readily be determined by the skilled artisan by sequencing of 
the deposited clone using routine methods in the art. The predicted amino acid sequence can be determined 
from the nucleotide sequence using routine skill. For the TAT polypeptides and encoding nucleic acids 
described herein, in some cases, Applicants have identified what is believed to be the reading frame best 

20 identifiable with the sequence information available at the time. 

G. Anti-TAT Antibody and TAT Polypeptide Variants 

In addition to the anti-TAT antibodies and full-length native sequence TAT polypeptides described 
herein, it is contemplated that anti-TAT antibody and TAT polypeptide variants can be prepared. Anti-TAT 
antibody and TAT polypeptide variants can be prepared by introducing appropriate nucleotide changes into the 
25 encoding DNA, and/or by synthesis of the desired antibody or polypeptide. Those skilled in the art will 

appreciate that amino acid changes may alter post-translational processes of the anti-TAT antibody or TAT 
polypeptide, such as changing the number or position of glycosylation sites or altering the membrane anchoring 
characteristics. 

Variations in the anti-TAT antibodies and TAT polypeptides described herein, can be made, for 
30 example, using any of the techniques and guidelines for conservative and non-conservative mutations set forth, 
for instance, in U.S. Patent No. 5,364,934. Variations may be a substitution, deletion or insertion of one or 
more codons encoding the antibody or polypeptide that results in a change in the amino acid sequence as 
compared with the native sequence antibody or polypeptide. Optionally the variation is by substitution of at least 
one amino acid with any other amino acid in one or more of the domains of the anti-TAT antibody or TAT 
35 polypeptide. Guidance in determining which amino acid residue may be inserted, substituted or deleted without 
adversely affecting the desired activity may be found by comparing the sequence of the anti-TAT antibody or 
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TAT polypeptide with that of homologous known protein molecules and minimizing the number of amino acid 
sequence changes made in regions of high homology. Amino acid substitutions can be the result of replacing 
one amino acid with another amino acid having similar structural and/or chemical properties, sucli as the 
replacement of a leucine with a serine, i.e., conservative amino acid replacements. Insertions or deletions may 
optionally be in the range of about 1 to 5 amino acids. The variation allowed may be determined by 
systematically making insertions, deletions or substitutions of amino acids in the sequence and testing the 
resulting variants for activity exhibited by the full-length or mature native sequence. 

Anti-TAT antibody and TAT polypeptide fragments are provided herein. Such fragments may be 
truncated at the N-terminus or C -terminus, or may lack internal residues, for example, when compared with 
a full length native antibody or protein. Certain fragments lack amino acid residues that are not essential for 
a desired biological activity of the anti-TAT antibody or TAT polypeptide. 

Anti-TAT antibody and TAT polypeptide fragments may be prepared by any of a number of 
conventional techniques. Desired peptide fragments may be chemically synthesized. An alternative approach 
involves generating antibody or polypeptide fragments by enzymatic digestion, e. g. , by treating the protein with 
an enzyme known to cleave proteins at sites defined by particular amino acid residues, or by digesting the DNA 
with suitable restriction enzymes and isolating the desired fragment. Yet another suitable technique involves 
isolating and amplifying a DNA fragment encoding a desired antibody or polypeptide fragment, by polymerase 
chain reaction (PCR). Oligonucleotides that define the desired termini of the DNA fragment are employed at 
the 5* and 3* primers in the PCR. Preferably, anti-TAT antibody and TAT polypeptide fragments share at least 
one biological and/or immunological activity with the native anti-TAT antibody or TAT polypeptide disclosed 
herein. 

In particular embodiments, conservative substitutions of interest are shown in Table 6 under the heading 
of preferred substitutions. If such substitutions result in a change in biological activity, then more substantial 
changes, denominated exemplary substitutions in Table 6, or as further described below in reference to amino 
acid classes, are introduced and the products screened. 
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Table 6 



Original Exemplary Preferred 
Residue Substitutions Substitutions 

Ala (A) val; leu; ile val 

5 Arg (R) lys; gin; asn lys 

Asn (N) gin; his; lys; arg gin 

Asp (D) glu glu 

Cys (C) ser ser 

Gin (Q) asn asn 

10 Glu (E) asp asp 

Gly (G) pro; ala ala 

His (H) asn; gin; lys; arg arg 

lie (I) leu; val; met; ala; phe; 

norleucine leu 

15 Leu (L) norleucine; ile; val; 

met; ala; phe ile 

Lys (K) arg; gin; asn arg 

Met (M) leu; phe; ile leu 

Phe (F) leu; val; ile; ala; tyr leu 

20 Pro(P) ala ala 

Ser (S) thr thr 

Thr (T) ser ser 

Trp (W) tyr; phe tyr 

Tyr (Y) trp; phe; thr; ser phe 

25 Val (V) ile; leu; met; phe; 

ala; norleucine leu 



Substantial modifications in function or immunological identity of the anti-TAT antibody or TAT 
polypeptide are accomplished by selecting substitutions that differ significantly in their effect on maintaining 
30 (a) the structure of the polypeptide backbone in the area of the substitution, for example, as a sheet or helical 
conformation, (b) the charge or hydrophobicity of the molecule at the target site, or (c) the bulk of the side 
chain. Naturally occurring residues are divided into groups based on common side-chain properties: 

(1) hydrophobic: norleucine, met, ala, val, leu, ile; 

(2) neutral hydrophilic: cys, ser, thr; 
35 (3) acidic: asp, glu; 

(4) basic: asn, gin, his, lys, arg; 

(5) residues that influence chain orientation: gly, pro; and 

(6) aromatic: trp, tyr, phe. 

Non-conservative substitutions will entail exchanging a member of one of these classes for another 
40 class. Such substituted residues also may be introduced into the conservative substitution sites or, more 
preferably, into the remaining (non-conserved) sites. 
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The variations can be made using methods known in the art such as oligonucleotide -mediated (site- 
directed) mutagenesis, alanine scanning, and PCR mutagenesis. Site-directed mutagenesis [Carter et al. Nucl. 
Acids Res., 13:4331 (1986); Zoller et al., Nucl. Acids Res.. 10:6487 (1987)], cassette mutagenesis [Wells et 
al., Gene. 34:315 (1985)], restriction selection mutagenesis [Wells et al., Philos. Trans. R. Soc. London SerA. 
317:415 (1986)] or other known techniques can be performed on the cloned DNA to produce the anti-TAT 
antibody or TAT polypeptide variant DNA. 

Scanning amino acid analysis can also be employed to identify one or more amino acids along a 
contiguous sequence. Among the preferred scanning amino acids are relatively small, neutral amino acids. 
Such amino acids include alanine, glycine, serine, and cysteine. Alanine is typically a preferred scanning amino 
acid among this group because it eliminates the side-chain beyond the beta-carbon and is less likely to alter the 
main-chain conformation of the variant [Cunningham and Wells, Science. 244:1081-1085 (1989)]. Alanine is 
also typically preferred because it is the most common amino acid. Further, it is frequently found in both buried 
and exposed positions [Creighton, The Proteins. (W.H. Freeman & Co,, N.Y.); Chothia, J. Mol. Biol.. 150: 1 
(1976)]. If alanine substitution does not yield adequate amounts of variant, an isoteric amino acid can be used. 

Any cysteine residue not involved in maintaining the proper conformation of the anti-TAT antibody 
or TAT polypeptide also may be substituted, generally with serine, to improve the oxidative stability of the 
molecule and prevent aberrant crosslinking. Conversely, cysteine bond(s) may be added to the anti-TAT 
antibody or TAT polypeptide to improve its stability (particularly where the antibody is an antibody fragment 
such as an Fv fragment). 

A particularly preferred type of substitutional variant involves substituting one or more hypervariable 
region residues of a parent antibody (e.g. , a humanized or human antibody). Generally, the resulting variant(s) 
selected for further development will have improved biological properties relative to the parent antibody from 
which they are generated. A convenient way for generating such substitutional variants involves affinity 
maturation using phage display. Briefly, several hypervariable region sites (e.g., 6-7 sites) are mutated to 
generate all possible amino substitutions at each site. The antibody variants thus generated are displayed in a 
monovalent fashion from filamentous phage particles as fusions to the gene III product of M13 packaged within 
each particle. The phage-displayed variants are then screened for their biological activity (e.g., binding affinity) 
as herein disclosed. In order to identify candidate hypervariable region sites for modification, alanine scanning 
mutagenesis can be performed to identify hypervariable region residues contributing significantly to antigen 
binding. Alternatively, or additionally, it may be beneficial to analyze a crystal structure of the antigen-antibody 
complex to identify contact points between the antibody and human TAT polypeptide. Such contact residues 
and neighboring residues are candidates for substitution according to the techniques elaborated herein. Once 
such variants are generated, the panel of variants is subjected to screening as described herein and antibodies 
with superior properties in one or more relevant assays may be selected for further development. 

Nucleic acid molecules encoding amino acid sequence variants of the anti-TAT antibody are prepared 
by a variety of methods known in the art. These methods include, but are not limited to, isolation from a natural 
source (in the case of naturally occurring amino acid sequence variants) or preparation by oligonucleotide- 
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mediated (or site-directed) mutagenesis, PCR mutagenesis, and cassette mutagenesis of an earlier prepared 
variant or a non-variant version of the anti-TAT antibody. 

H. Modifications of Anti-TAT Antibodies and TAT Polypeptides 

Covalent modifications of anti-TAT antibodies and TAT polypeptides are included within the scope of 
this invention. One type of covalent modification includes reacting targeted amino acid residues of an anti-TAT 
5 antibody or TAT polypeptide with an organic derivatizing agent that is capable of reacting with selected side 
chains or the N- or C- terminal residues of the anti-TAT antibody or TAT polypeptide. Derivatization with 
bifunctional agents is useful, for instance, for crosslinking anti-TAT antibody or TAT polypeptide to a water- 
insoluble support matrix or surface for use in the method for purifying anti-TAT antibodies, and vice-versa. 
Commonly used crosslinking agents include, e.g., l,l-bis(diazoacetyl)-2-phenylethane, glutaraldehyde, N- 

10 hydroxysuccinimide esters, for example, esters with 4-azidosalicylic acid, homobifiinctional imidoesters, 
including disuccinimidyl esters such as S.S'-dithiobisCsaiccinimidylpropionate), bifunctional maleimides such as 
bis -N-maleimido- 1,8 -octane and agents such as methyl-3-[(p-azidophenyl)ditMo]propioimidate. 

Other modifications include deamidation of glutaminyl and asparaginyl residues to the corresponding 
glutamyl and aspartyl residues, respectively, hydroxylation of proline and lysine, phosphorylation of hydroxyl 

15 groups of seryl or threonyl residues, methylation of the cc-amino groups of lysine, arginine, and histidine side 
chains [T.E. Creighton, Proteins: Structure and Molecular Properties. W.H. Freeman & Co., San Francisco, 
pp. 79-86 (1983)], acetylation of the N-tenninal amine, and amidation of any C-terminal carboxyl group. 

Another type of covalent modification of the anti-TAT antibody or TAT polypeptide included within 
the scope of this invention comprises altering the native glycosylation pattern of the antibody or polypeptide. 

20 "Altering the native glycosylation pattern" is intended for purposes herein to mean deleting one or more 

carbohydrate moieties found in native sequence anti-TAT antibody or TAT polypeptide (either by removing the 
underlying glycosylation site or by deleting the glycosylation by chemical and/or enzymatic means), and/or 
adding one or more glycosylation sites that are not present in the native sequence anti-TAT antibody or TAT 
polypeptide. In addition, the phrase includes qualitative changes in the glycosylation of the native proteins, 

25 involving a change in the nature and proportions of the various carbohydrate moieties present. 

Glycosylation of antibodies and other polypeptides is typically either N-linked or O-linked. N-linked 
refers to the attachment of the carbohydrate moiety to the side chain of an asparagine residue. The tripeptide 
sequences asparagine-X-serine and asparagine-X-threonine, where X is any amino acid except proline, are the 
recognition sequences for enzymatic attachment of the carbohydrate moiety to the asparagine side chain. Thus, 

30 the presence of either of these tripeptide sequences in a polypeptide creates a potential glycosylation site. O- 
linked glycosylation refers to the attachment of one of the sugars N-aceylgalactosamine, galactose, or xylose 
to a hydroxyamino acid, most commonly serine or threonine, although 5-hydroxyproline or 5-hydroxylysine may 
also be used. 



289 



WO 2004/030615 



PCT/US2003/028547 



Addition of glycosylation sites to the anti-TAT antibody or TAT polypeptide is conveniently 
accomplished by altering the amino acid sequence such that it contains one or more of the above-described 
tripeptide sequences (for N-linked glycosylation sites). The alteration may also be made by the addition of, or 
substitution by, one or more serine or threonine residues to the sequence of the original anti-TAT antibody or 
TAT polypeptide (for O-linked glycosylation sites). The anti-TAT antibody or TAT polypeptide amino acid 
5 sequence may optionally be altered through changes at the DNA level, particularly by mutating the DNA 

encoding the anti-TAT antibody or TAT polypeptide at preselected bases such that codons are generated that 
will translate into the desired amino acids. 

Another means of increasing the number of carbohydrate moieties on the anti-TAT antibody or TAT 
10 polypeptide is by chemical or enzymatic coupling of glycosides to the polypeptide. Such methods are described 

in the art, e.g., in WO 87/05330 published 11 September 1987, and in Aplin and Wriston, CRC Crit. Rev. 

Biochem.. pp. 259-306 (1981). 

Removal of carbohydrate moieties present on the anti-TAT antibody or TAT polypeptide may be 

accomplished chemically or enzymatically or by mutational substitution of codons encoding for amino acid 
1 5 residues that serve as targets for glycosylation. Chemical deglycosylation techniques are known in the art and 

described, for instance, by Hakimuddin, et ah, Arch. Biochem. Biophvs.. 259:52 (1987) and by Edge et al., 

Anal.Biochem.. 118: 131 (1981). Enzymatic cleavage of carbohydrate moieties on polypeptides can be achieved 

by the use of a variety of endo- and exo-glycosidases as described by Thotakura et al., Meth. Enzvmol.. 

138:350 (1987). 

20 Another type of covalent modification of anti-TAT antibody or TAT polypeptide comprises linking the 

antibody or polypeptide to one of a variety of nonproteinaceous polymers, e.g., polyethylene glycol (PEG), 
polypropylene glycol, or polyoxyalkylenes, in the manner set forth in U.S. Patent Nos. 4,640,835; 4,496,689; 
4,301,144; 4,670,417; 4,791,192 or 4,179,337. The antibody or polypeptide also may be entrapped in 
microcapsules prepared, for example, by coacervation techniques or by interfacial polymerization (for example, 

25 hydroxymethylcellulose or gelatin-microcapsules and poly-(methylmethacylate) microcapsules, respectively), 
in colloidal drug delivery systems (for example, liposomes, albumin microspheres, microemulsions, nano- 
particles and nanocapsules), or in macroemulsions. Such techniques are disclosed in Remington's 
Pharmaceutical Sciences. 16th edition, Oslo, A., Ed., (1980). 

The anti-TAT antibody or TAT polypeptide of the present invention may also be modified in a way to 

30 form chimeric molecules comprising an anti-TAT antibody or TAT polypeptide fused to another, heterologous 
polypeptide or amino acid sequence. 

In one embodiment, such a chimeric molecule comprises a fusion of the anti-TAT antibody or TAT 
polypeptide with a tag polypeptide which provides an epitope to which an anti-tag antibody can selectively bind. 
The epitope tag is generally placed at the amino- or carboxyl- terminus of the anti-TAT antibody or TAT 

35 polypeptide. The presence of such epitope-tagged forms of the anti-TAT antibody or TAT polypeptide can be 
detected using an antibody against the tag polypeptide. Also, provision of the epitope tag enables the anti-TAT 
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antibody or TAT polypeptide to be readily purified by affinity purification using an anti-tag antibody or another 
type of affinity matrix that binds to the epitope tag. Various tag polypeptides and their respective antibodies 
are well known in the art. Examples include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) 
tags; the flu HA tag polypeptide and its antibody 12CA5 [Field et al., Mol. Cell. Biol.. 8:2159-2165 (1988)]; 
the c-myc tag and the 8F9, 3C7, 6E10, G4, B7 and 9E10 antibodies thereto [Evan et al. , Molecular and Cellular 
5 Biology. 5:3610-3616 (1985)] ; and the Herpes Simplex virus glycoprotein D (gD) tag and its antibody [Paborsky 
et al. , Protein Engineering. 3(6):547-553 (1990)]. Other tag polypeptides include the Flag-peptide [Hopp et al. , 
BioTechnology. 6:1204-1210 (1988)]; the KT3 epitope peptide [Martin et al., Science. 255:192-194 (1992)]; 
an a-tubulin epitope peptide [Skinner et al., J. Biol. Chem.. 266:15163-15166 (1991)]; and the T7 gene 10 
protein peptide tag [Lutz-Freyermuth et al., Proc. Natl. Acad. Sci. USA. 87:6393-6397 (1990)]. 

10 

In an alternative embodiment, die chimeric molecule may comprise a fusion of the anti-TAT antibody 
or TAT polypeptide with an immunoglobulin or a particular region of an immunoglobulin. For a bivalent form 
of the chimeric molecule (also referred to as an "immunoadhesin"), such a fusion could be to the Fc region of 
an IgG molecule. The Ig fusions preferably include the substitution of a soluble (transmembrane domain deleted 

15 or inactivated) form of an anti-TAT antibody or TAT polypeptide in place of at least one variable region within 
an Ig molecule. In a particularly preferred embodiment, the immunoglobulin fusion includes the hinge, CH 2 
and CH 3 , or the hinge, CH,, CH 2 and CH 3 regions of an IgGl molecule. For the production of immunoglobulin 
fusions see also US Patent No. 5,428,130 issued June 27, 1995. 

I. Preparation of Anti-TAT Antibodies and TAT Polypeptides 

20 The description below relates primarily to production of anti-TAT antibodies and TAT polypeptides 

by culturing cells transformed or transfected with a vector containing anti-TAT antibody- and TAT polypeptide- 
encoding nucleic acid. It is, of course, contemplated that alternative methods, which are well known in the art, 
may be employed to prepare anti-TAT antibodies and TAT polypeptides. For instance, the appropriate amino 
acid sequence, or portions thereof, may be produced by direct peptide synthesis using solid-phase techniques 

25 [see, e.g., Stewart et al., Solid-Phase Peptide Synthesis. W.H. Freeman Co., San Francisco, CA (1969); 

Merrifield, J. Am. Chem. Soc. 85:2149-2154 (1963)]. In vitro protein synthesis may be performed using 
manual techniques or by automation. Automated synthesis may be accomplished, for instance, using an Applied 
Biosystems Peptide Synthesizer (Foster City, CA) using manufacturer's instructions. Various portions of the 
anti-TAT antibody or TAT polypeptide may be chemically synthesized separately and combined using chemical 

30 or enzymatic methods to produce the desired anti-TAT antibody or TAT polypeptide. 

1. Isolation of PNA Encoding Anti-TAT Antibody or TAT Polypeptide 
DNA encoding anti-TAT antibody or TAT polypeptide may be obtained from a cDNA library prepared 
from tissue believed to possess the anti-TAT antibody or TAT polypeptide mRNA and to express it at a 
detectable level. Accordingly, human anti-TAT antibody or TAT polypeptide DNA can be conveniently 

35 obtained from a cDNA library prepared from human tissue. The anti-TAT antibody- or TAT polypeptide- 
encoding gene may also be obtained from a genomic library or by known synthetic procedures (e.g. , automated 
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nucleic acid synthesis). 

Libraries can be screened with probes (such as oligonucleotides of at least about 20-80 bases) designed 
to identify the gene of interest or the protein encoded by it. Screening the cDNA or genomic library with the 
selected probe may be conducted using standard procedures, such as described in Sambrook et al., Molecular 
Cloning: A Laboratory Manual (New York: Cold Spring Harbor Laboratory Press, 1989). An alternative means 
5 to isolate the gene encoding anti-TAT antibody or TAT polypeptide is to use PCR methodology [Sambrook et 
al., supra: Dieffenbach et al., PCR Primer: A Laboratory Manual (Cold Spring Harbor Laboratory Press, 
1995)]. 

Techniques for screening a cDNA library are well known in tiie art. The oligonucleotide sequences 
selected as probes should be of sufficient length and sufficiently unambiguous that false positives are minimized. 

1 0 The oligonucleotide is preferably labeled such that it can be detected upon hybridization to DNA in the library 
being screened. Methods of labeling are well known in the art, and include the use of radiolabels like 32 P- 
labeled ATP, biotinylation or enzyme labeling. Hybridization conditions, including moderate stringency and 
high stringency, are provided in Sambrook et al., supra. 

Sequences identified in such library screening methods can be compared and aligned to other known 

15 sequences deposited and available in public databases such as GenBank or other private sequence databases. 

Sequence identity (at either the amino acid or nucleotide level) within defined regions of the molecule or across 
the full-length sequence can be determined using methods known in the art and as described herein. 

Nucleic acid having protein coding sequence may be obtained by screening selected cDNA or genomic 
libraries using the deduced amino acid sequence disclosed herein for the first time, and, if necessary, using 

20 conventional primer extension procedures as described in Sambrook et al., supra, to detect precursors and 
processing intermediates of mRNA that may not have been reverse-transcribed into cDNA. 
2. Selection and Transformation of Host Cells 
Host cells are transfected or transformed with expression or cloning vectors described herein for anti- 
TAT antibody or TAT polypeptide production and cultured in conventional nutrient media modified as 

25 appropriate for inducing promoters, selecting transformants, or amplifying the genes encoding the desired 

sequences. The culture conditions, such as media, temperature, pH and the like, can be selected by the skilled 
artisan without undue experimentation. In general, principles, protocols, and practical techniques for 
maximizing the productivity of cell cultures can be found in Mammalian Cell Biotechnology: a Practical 
Approach. M. Butler, ed. (IRL Press, 1991) and Sambrook et al., supra. 

3 0 Methods of eukaryotic cell transfection and prokary otic cell transformation are known to the ordinarily 

skilled artisan, for example, CaCl 2 , CaP0 4 , liposome-mediated and electroporation. Depending on the host cell 
used, transformation is performed using standard techniques appropriate to such cells. The calcium treatment 
employing calcium chloride, as described in Sambrook et al., supra, or electroporation is generally used for 
prokaryotes. Infection with Agrobacterium tumefaciens is used for transformation of certain plant cells, as 

35 described by Shaw et al., Gene. 23:315 (1983) and WO 89/05859 published 29 June 1989. For mammalian 
cells without such cell walls, the calcium phosphate precipitation method of Graham and van der Eb, Virology. 
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52:456-457 (1978) can be employed. General aspects of mamm alian cell host system transfections have been 
described in U.S. Patent No. 4,399,216. Transformations into yeast are typically carried out according to the 
method of Van Solingen et al., J. Bact.. 130:946 (1977) and Hsiao et ah, Proc. Natl. Acad. Sci. (USA), 
76:3829 (1979). However, other methods for introducing DNA into cells, such as by nuclear microinjection, 
electroporation, bacterial protoplast fusion with intact cells, or polycations, e.g. , polybrene, polyornithine, may 
5 also be used. For various techniques for transforming mammalian cells, see Keown et al., Methods in 
Enzvmology. 185:527-537 (1990) and Mansour et al., Nature. 336:348-352 (1988). 

Suitable host cells for cloning or expressing the DNA in the vectors herein include prokaryote, yeast, 
or higher eukaryote cells. Suitable prokaryotes include but are not limited to eubacteria, such as Gram-negative 
or Gram-positive organisms, for example, Enterobacteriaceae such as E. coli. Various E. coli strains are 

10 publicly available, such as E. coli K12 strain MM294 (ATCC 31,446); E. coli X1776 (ATCC 31,537); E. coli 
strain W3110 (ATCC 27,325) and K5 772 (ATCC 53,635). Other suitable prokaryotic host cells include 
Enterobacteriaceae such as Escherichia, e.g., E. coli, Enterobacter, Erwinia, Klebsiella, Proteus, Salmonella, 
e.g., Salmonella typhimuriwn, Serratia, e.g., Serratia marcescans, and Shigella, as well as Bacilli such as B. 
subtilis and B. licheniformis (e.g., B. licheniformis 41P disclosed in DD 266,710 published 12 April 1989), 

15 Pseudomonas such as P. aeruginosa, and Streptomyces. These examples are illustrative rather than limiting. 

Strain W3110 is one particularly preferred host or parent host because it is a common host strain for 
recombinant DNA product fermentations. Preferably, the host cell secretes minimal amounts of proteolytic 
enzymes. For example, strain W3110 may be modified to effect a genetic mutation in the genes encoding 
proteins endogenous to the host, with examples of such hosts including E. coli W3110 strain 1A2, which has 

20 the complete genotype tonA ; E. coli W3110 strain 9E4, which has the complete genotype tonA ptr3; E. coli 
W3 1 10 strain 27C7 (ATCC 55,244), which has the complete genotype tonA ptr3 phoA E15 (argF-lac)169 degP 
ornpTkan r ; E. coli W3110 strain 37D6, which has the complete genotype tonA ptr3 phoA El 5 (argF-lac)169 
degP ompT rbs7 ilvG karf; E. coli W31 10 strain 40B4, which is strain 37D6 with a non-kanamycin resistant 
degP deletion mutation; and an E. coli strain having mutant periplasmic protease disclosed in U.S. Patent No. 

25 4,946,783 issued 7 August 1990. Alternatively, in vitro methods of cloning, e.g., PCR or other nucleic acid 
polymerase reactions, are suitable. 

Full length antibody, antibody fragments, and antibody fusion proteins can be produced in bacteria, 
in particular when glycosylation and Fc effector function are not needed, such as when the therapeutic antibody 
is conjugated to a cytotoxic agent (e.g. , a toxin) and the immunoconjugate by itself shows effectiveness in tumor 

30 cell destruction. Full length antibodies have greater half life in circulation. Production in E. coli is faster and 
more cost efficient. For expression of antibody fragments and polypeptides in bacteria, see, e.g., U.S. 
5,648,237 (Carter et. al.), U.S. 5,789,199 (Joly et al.), and U.S. 5,840,523 (Simmons et al.) which describes 
translation initiation regio (TIR) and signal sequences for optimizing expression and secretion, these patents 
incorporated herein by reference. After expression, the antibody is isolated from the E. coli cell paste in a 

35 soluble fraction and can be purified through, e.g., a protein A or G column depending on the isotype. Final 
purification can be carried out similar to the process for purifying antibody expressed e.g,, in CHO cells. 
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In addition to prokaiyotes, eukaryotic microbes such as filamentous fungi or yeast are suitable cloning 
or expression hosts for anti-TAT antibody- or TAT polypeptide-encoding vectors. Saccharomyces cerevisiae 
is a commonly used lower eukaryotic host microorganism. Others inchxdeSchizosaccharomyces pombe (Beach 
and Nurse, Nature, 290: 140 [1981]; EP 139,383 published 2 May 1985); Kluyveromyces hosts (U.S. Patent 
No. 4,943,529; Fleer et al., Bio/Technology, 9:968-975 (1991)) such as, e.g., K. loads (MW98-8C, CBS683, 
CBS4574; Louvencourt et al., J. Bacteriol., 154(2):737-742 [1983]), K. fragilis (ATCC 12,424), K. bulgaricus 
(ATCC 16,045), K. wickeramii (ATCC 24,178), K. waltii (ATCC 56,500), K. drosophilarum (ATCC 36,906; 
Van den Berg et al., Bio/Technology, 8:135 (1990)), K. thermotolerans, and K. marxianus; yarrowia (EP 
402,226); Pichiapastoris (EP 183,070; Sreekrishna et al., J. Basic Microbiol. , 28:265-278 [1988]); Candida; 
THcJwderma reesia (EP 244,234); Neurospora crassa (Case et al., Proc. Natl. Acad. Sci. USA. 76:5259-5263 
[1979]); Schwanniomyces such as Schwanniomyces occidentalis (EP 394,538 published 31 October 1990); and 
filamentous fungi such as, e.g., Neurospora, Penicillium, Tolypocladiwn (WO 91/00357 published 10 January 
1991), and Aspergillus hosts such as A, mdiilans (Ballance et al. , Biochem. Bioohvs. Res. Commun.. 1 12:284- 
289 [1983]; Tilburnet al., Gene, 26:205-221 [1983]; Yeltonet al., Proc. Natl. Acad. Sci. USA. 81: 1470-1474 
[1984]) and^. niger (Kelly and Hynes, EMBOJ.. 4:475-479 [1985]). Methylotropic yeasts are suitable herein 
and include, but are not limited to, yeast capable of growth on methanol selected from the genera consisting of 
Hansenula, Candida, Kloeckera, Pichia, Saccharomyces, Torulopsis,andRhodotorula. A list of specific species 
that are exemplary of this class of yeasts may be found in C. Anthony, The Biochemistry of Methvlotrophs. 269 
(1982). 

Suitable host cells for the expression of glycosylated anti-TAT antibody or TAT polypeptide are derived 
from multicellular organisms. Examples of invertebrate cells include insect cells such as Drosophila S2 and 
Spodoptera Sf9, as well as plant cells, such as cell cultures of cotton, corn, potato, soybean, petunia, tomato, 
and tobacco. Numerous baculoviral strains and variants and corresponding permissive insect host cells from 
hosts such as Spodoptera frugiperda (caterpillar), Aedes aegypti (mosquito), Aedes albopictus (mosquito), 
Drosophila melanogaster (fruitfly), and Bombyx mori have been identified. A variety of viral strains for 
transfection are publicly available, e.g., the L-l variant of Autographa californica NPV and the Bm-5 strain of 
Bombyx mori NPV, and such viruses may be used as the virus herein according to the present invention, 
particularly for transfection of Spodoptera frugiperda cells. 

However, interest has been greatest in vertebrate cells, and propagation of vertebrate cells in culture 
(tissue culture) has become a routine procedure. Examples of useful mammalian host cell lines are monkey 
kidney CV1 line transformed by SV40 (COS-7, ATCC CRL 1651); human embryonic kidney line (293 or 293 
cells subcloned for growth in suspension culture, Graham et al., J. Gen Virol. 36:59 (1977)); baby hamster 
kidney cells (BHK, ATCC CCL 10); Chinese hamster ovary cells/-DHFR (CHO, Urlaub et al., Proc. Natl. 
Acad. Sci. USA77:4216 (1980)); mouse Sertoli cells (TM4, Mather, Biol.Reprod. 23:243-251 (1980)); monkey 
kidney cells (CV1 ATCC CCL 70); African green monkey kidney cells (VERO-76, ATCC CRL-1587); human 
cervical carcinoma cells (HELA, ATCC CCL 2); canine kidney cells (MDCK, ATCC CCL 34); buffalo rat 
liver cells (BRL 3A, ATCC CRL 1442); human lung cells (W138, ATCC CCL 75); human liver cells (Hep G2, 
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HB 8065); mouse mammary tumor (MMT 060562, ATCC CCL51); TRI cells (Mather et ah, Annals N.Y. 
Acad. Sci. 383:44-68 (1982)); MRC 5 cells; FS4 cells; and a human hepatoma line (Hep G2). 

Host cells are transformed with the above-described expression or cloning vectors for anti-TAT 
antibody or TAT polypeptide production and cultured in conventional nutrient media modified as appropriate 
for inducing promoters, selecting transformants, or amplifying the genes encoding the desired sequences. 
5 3. Selection and Use of a Replicable Vector 

The nucleic acid (e.g. , cDNA or genomic DNA) encoding anti-TAT antibody or TAT polypeptide may 
be inserted into a replicable vector for cloning (amplification of the DNA) or for expression. Various vectors 
are publicly available. The vector may, for example, be in the form of a plasmid, cosmid, viral particle, or 
phage. The appropriate nucleic acid sequence may be inserted into the vector by a variety of procedures. In 

10 general, DNA is inserted into an appropriate restriction endonuclease site(s) using techniques known in the art. 

Vector components generally include, but are not limited to, one or more of a signal sequence, an origin of 
replication, one or more marker genes, an enhancer element, a promoter, and a transcription termination 
sequence. Construction of suitable vectors containing one or more of these components employs standard 
ligation techniques which are known to the skilled artisan. 

15 The TAT may be produced recombinantly not only directly, but also as a fusion polypeptide with a 

heterologous polypeptide, which may be a signal sequence or other polypeptide having a specific cleavage site 
at the N-terminus of the mature protein or polypeptide. In general, the signal sequence may be a component 
of the vector, or it may be a part of the anti-TAT antibody- or TAT polypeptide-encoding DNA that is inserted 
into the vector. The signal sequence may be a prokaryotic signal sequence selected, for example, from the 

20 group of the alkaline phosphatase, penicillinase, lpp, or heat-stable enterotoxin II leaders. For yeast secretion 
the signal sequence may be, e.g., the yeast invertase leader, alpha factor leader (including Saccharomyces and 
Kluyveromyces a-factor leaders, the latter described in U.S. Patent No. 5,010, 182), or acid phosphatase leader, 
the C. albicans glucoamylase leader (EP 362,179 published 4 April 1990), or the signal described in WO 
90/13646 published 15 November 1990. In mammalian cell expression, mammalian signal sequences may be 

25 used to direct secretion of the protein, such as signal sequences from secreted polypeptides of the same or 
related species, as well as viral secretory leaders. 

Both expression and cloning vectors contain a nucleic acid sequence that enables the vector to replicate 
in one or more selected host cells. Such sequences are well known for a variety of bacteria, yeast, and viruses. 
The origin of replication from the plasmid pBR322 is suitable for most Gram-negative bacteria, the 2\i plasmid 

30 origin is suitable for yeast, and various viral origins (SV40, polyoma, adenovirus, VSV or BPV) are useful for 
cloning vectors in mammalian cells. 

Expression and cloning vectors will typically contain a selection gene, also termed a selectable marker. 
Typical selection genes encode proteins that (a) confer resistance to antibiotics or other toxins, e.g. , ampicillin, 
neomycin, methotrexate, or tetracycline, (b) complement auxotrophic deficiencies, or (c) supply critical 

35 nutrients not available from complex media, e.g., the gene encoding D-alanine racemase for Bacilli. 

An example of suitable selectable markers for mammalian cells are those that enable the identification 
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of cells competent to take up the anti-TAT antibody- or TAT polypeptide-encoding nucleic acid, such as DHFR 
or thymidine kinase. An appropriate host cell when wild-type DHFR is employed is the CHO cell line deficient 
in DHFR activity, prepared and propagated as described by Urlaub et al. , Proc. Natl. Acad. Sci. USA. 77:4216 
(1980). A suitable selection gene for use in yeast is the trpl gene present in the yeast plasmid YRp7 
[Stinchcomb et al., Nature, 282:39 (1979); Kingsman et al., Gene, 7:141 (1979); Tschemper et al., Gene. 
5 10: 157 (1980)] . The trpl gene provides a selection marker for a mutant strain of yeast lacking the ability to 
grow in tryptophan, for example, ATCC No. 44076 or PEP4-1 [Jones, Genetics. 85:12 (1977)]. 

Expression and cloning vectors usually contain a promoter operably linked to the anti-TAT antibody- 
or TAT polypeptide-encoding nucleic acid sequence to direct mRNA synthesis. Promoters recognized by a 
variety of potential host cells are well known. Promoters suitable for use with prokaryotic hosts include th<p- 

10 lactamase and lactose promoter systems [Chang etal., Nature. 275:615 (1978); Goeddeletal., Nature. 281:544 
(1979)], alkaline phosphatase, a tryptophan (trp) promoter system [Goeddel. Nucleic Acids Res. . 8:4057 (1980); 
EP 36,776], and hybrid promoters such as the tac promoter [deBoer et al., Proc. Nad. Acad. Sci. USA. 80:21- 
25 (1983)] . Promoters for use in bacterial systems also will contain a Shine-Dalgarno (S. D.) sequence operably 
linked to the DNA encoding anti-TAT antibody or TAT polypeptide. 

15 Examples of suitable promoting sequences for use with yeast hosts include the promoters for 3- 

phosphoglycerate kinase [Hitzeman et al., J. Biol. Chem.. 255:2073 (1980)] or other glycolytic enzymes [Hess 
et J- Adv. Enzyme Rep., 7:149 (1968); Holland, Biochemistry. 17:4900 (1978)], such as enolase, 
glyceraldehyde-3-phosphate dehydrogenase, hexokinase, pyruvate decarboxylase, phosphofructokinase, glucose- 
6-phosphate isomerase, 3-phosphoglycerate mutase, pyruvate kinase, triosephosphate isomerase, phosphoglucose 

20 isomerase, and glucokinase. 

Other yeast promoters, which are inducible promoters having the additional advantage of transcription 
controlled by growth conditions, are the promoter regions for alcohol dehydrogenase 2, isocytochrome C, acid 
phosphatase, degradative enzymes associated with nitrogen metabolism, metallothionein, glyceraldehyde-3- 
phosphate dehydrogenase, and enzymes responsible for maltose and galactose utilization. Suitable vectors and 

25 promoters for use in yeast expression are further described in EP 73,657. 

Anti-TAT antibody or TAT polypeptide transcription from vectors in mammalian host cells is 
controlled, for example, by promoters obtained from the genomes of viruses such as polyoma virus, fowlpox 
virus (UK 2,211,504 published 5 July 1989), adenovirus (such as Adenovirus 2), bovine papilloma virus, avian 
sarcoma virus, cytomegalovirus, a retrovirus, hepatitis-B virus and Simian Virus 40 (SV40), from heterologous 

30 mammalian promoters, e.g., the actin promoter or an immunoglobulin promoter, and from heat-shock 
promoters, provided such promoters are compatible with the host cell systems. 

Transcription of a DNA encoding the anti-TAT antibody or TAT polypeptide by higher eukaryotes may 
be increased by inserting an enhancer sequence into the vector. Enhancers are cis-acting elements of DNA, 
usually about from 10 to 300 bp, mat act on a promoter to increase its transcription. Many enhancer sequences 

35 are now known from mammalian genes (globin, elastase, albumin, a-fetoprotein, and insulin). Typically, 
however, one will use an enhancer from a eukaryotic cell virus. Examples include the SV40 enhancer on the 
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late side of the replication origin (bp 100-270), the cytomegalovirus early promoter enhancer, the polyoma 
enhancer on the late side of the replication origin, and adenovirus enhancers. The enhancer may be spliced into 
the vector at a position 5' or 3 1 to the anti-TAT antibody or TAT polypeptide coding sequence, but is preferably 
located at a site 5 1 from the promoter. 

Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, animal, human, or 
5 nucleated cells from .other multicellular organisms) will also contain sequences necessary for the termination 
of transcription and for stabilizing the mRNA. Such sequences are commonly available from the 5' and, 
occasionally 3 1 , untranslated regions of eukaryotic or viral DNAs or cDNAs. These regions contain nucleotide 
segments transcribed as polyadenylated fragments in the untranslated portion of the mRNA encoding anti-TAT 
antibody or TAT polypeptide. 

1 0 Still other methods, vectors, and host cells suitable for adaptation to the synthesis of anti-TAT antibody 

or TAT polypeptide in recombinant vertebrate cell culture are described in Gething et al. , Nature. 293:620-625 
(1981); Mantei et al., Nature. 281:40-46 (1979); EP 117,060; and EP 117,058. 
4. Culturing the Host Cells 
The host cells used to produce the anti-TAT antibody or TAT polypeptide of this invention may be 

15 cultured in a variety of media. Commercially available media such as Ham's F10 (Sigma), Minimal Essential 
Medium ((MEM), (Sigma), RPMI-1640 (Sigma), and Dulbecco's Modified Eagle's Medium ((DMEM), Sigma) 
are suitable for culturing the host cells. In addition, any of the media described in Ham et al Meth. Enz. 58:44 
(1979), Barnes et al., Anal. Biochem. 1 02:255 (1980), U.S. Pat. Nos. 4,767,704; 4,657,866; 4,927,762; 
4,560,655; or 5,122,469; WO 90/03430; WO 87/00195; or U.S. Patent Re. 30,985 may be used as culture 

20 media for the host cells. Any of these media may be supplemented as necessary with hormones and/or other 
growth factors (such as insulin, transferrin, or epidermal growth factor), salts (such as sodium chloride, 
calcium, magnesium, and phosphate), buffers (such as HEPES), nucleotides (such as adenosine and thymidine), 
antibiotics (such as GENTAMYCEN™ drug), trace elements (defined as inorganic compounds usually present 
at final concentrations in the micromolar range), and glucose or an equivalent energy source. Any other 

25 necessary supplements may also be included at appropriate concentrations that would be known to those skilled 
in the art. The culture conditions, such as temperature, pH, and the like, are those previously used with the host 
cell selected for expression, and will be apparent to the ordinarily skilled artisan. 
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5. Detecting Gene Amplification/Expression 

Gene amplification and/or expression may be measured in a sample directly, for example, by 
conventional Southern blotting, Northern blotting to quantitate the transcription of mRNA [Thomas, Proc. Natl. 
Acad. Sci. USA , 77:5201-5205 (1980)], dot blotting (DNA analysis), or in situ hybridization, using an 
appropriately labeled probe, based on the sequences provided herein. Alternatively, antibodies may be 
5 employed that can recognize specific duplexes, including DNA duplexes, RNA duplexes, and DNA-RNA hybrid 
duplexes or DNA-protein duplexes. The antibodies in turn may be labeled and the assay may be carried out 
where the duplex is bound to a surface, so that upon the formation of duplex on the surface, the presence of 
antibody bound to the duplex can be detected. 

Gene expression, alternatively, may be measured by immunological methods, such as 

10 immunohistochemical staining of cells or tissue sections and assay of cell culture or body fluids, to quantitate 
directly the expression of gene product. Antibodies useful for immunohistochemical staining and/or assay of 
sample fluids may be either monoclonal or polyclonal, and may be prepared in any mammal. Conveniently, 
the antibodies may be prepared against a native sequence TAT polypeptide or against a synthetic peptide based 
on the DNA sequences provided herein or against exogenous sequence fused to TAT DNA and encoding a 

15 specific antibody epitope. 

6. Purification of Anti-TAT Antibody and TAT Polypeptide 

Forms of anti-TAT antibody and TAT polypeptide may be recovered from culture medium or from host 
cell lysates. If membrane-bound, it can be released from the membrane using a suitable detergent solution (e.g. 
Triton-X 100) or by enzymatic cleavage. Cells employed in expression of anti-TAT antibody and TAT 

20 polypeptide can be disrupted by various physical or chemical means, such as freeze-thaw cycling, sonication, 
mechanical disruption, or cell lysing agents. 

It may be desired to purify anti-TAT antibody and TAT polypeptide from recombinant cell proteins 
or polypeptides. The following procedures are exemplary of suitable purification procedures: by fractionation 
on an ion-exchange column; ethanol precipitation; reverse phase HPLC; chromatography on silica or on a 

25 cation-exchange resin such as DEAE; chromatofocusing; SDS-PAGE; ammonium sulfate precipitation; gel 
filtration using, for example, Sephadex G-75; protein A Sepharose columns to remove contaminants such as 
IgG; and metal chelating columns to bind epitope-tagged forms of the anti-TAT antibody and TAT polypeptide. 
Various methods of protein purification may be employed and such methods are known in the art and described 
for example in Deutscher, Methods in Enzymology. 182 (1990); Scopes, Protein Purification: Principles and 

30 Practice. Springer- Verlag, New York (1982). The purification step(s) selected will depend, for example, on 
the nature of the production process used and the particular anti-TAT antibody or TAT polypeptide produced. 

When using recombinant techniques, the antibody can be produced intracellularly, in the periplasmic 
space, or directly secreted into the medium. If the antibody is produced intracellularly, as a first step, the 
particulate debris, either host cells or lysed fragments, are removed, for example, by centrifugation or 

35 ultrafiltration. Carter et al., Bio/Technology 10:163-167 (1992) describe a procedure for isolating antibodies 
which are secreted to the periplasmic space of E. colu Briefly, cell paste is thawed in die presence of sodium 
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acetate (pH 3.5), EDTA, and phenylmethylsulfonylfluoride (PMSF) over about 30 min. Cell debris can be 
removed by centrifugation. Where the antibody is secreted into the medium, supernatants from such expression 
systems are generally first concentrated using a commercially available protein concentration filter, for example, 
an Amicon or Millipore Pellicon ultrafiltration unit. A protease inhibitor such as PMSF may be included in any 
of title foregoing steps to inhibit proteolysis and antibiotics may be included to prevent the growth of adventitious 
5 contaminants. 

The antibody composition prepared from the cells can be purified using, for example, hydroxylapatite 
chromatography, gel electrophoresis, dialysis, and affinity chromatography, with affinity chromatography being 
the preferred purification technique. Hie suitability of protein A as an affinity ligand depends on the species 
and isotype of any immunoglobulin Fc domain that is present in the antibody. Protein A can be used to purify 

10 antibodies, that are based on human yl, y2 or y4 heavy chains (Lindmark et al., J. Immunol. Meth. 62:1-13 
(1983)). Protein G is recommended for all mouse isotypes and for human y3 (Guss et al., EMBO J. 
5:15671575 (1986)). The matrix to which the affinity ligand is attached is most often agarose, but other 
matrices are available. Mechanically stable matrices such as controlled pore glass or 
poly(styrenedivinyl)benzene allow for faster flow rates and shorter processing times than can be achieved with 

15 agarose. Where the antibody comprises a Q3 domain, the Bakerbond ABX™resin (J. T. Baker, Phillipsburg, 
NJ) is useful for purification. Other techniques for protein purification such as fractionation on an ion-exchange 
column, ethanol precipitation, Reverse Phase HPLC, chromatography on silica, chromatography on heparin 
SEPHAROSE™ chromatography on an anion or cation exchange resin (such as a polyaspartic acid column), 
chromatofocusing, SDS-PAGE, and ammonium sulfate precipitation are also available depending on the antibody 

20 to be recovered. 

Following any preliminary purification step(s), the mixture comprising the antibody of interest and 
contaminants may be subjected to low pH hydrophobic interaction chromatography using an elution buffer at 
a pH between about 2.5-4.5, preferably performed at low salt concentrations (e.g., from about 0-0.25M salt). 
J. Pharmaceutical Formulations 

25 Therapeutic formulations of the anti-TAT antibodies, TAT binding oligopeptides, TATbinding organic 

molecules and/or TAT polypeptides used in accordance with the present invention are prepared for storage by 
mixing the antibody, polypeptide, oligopeptide or organic molecule having the desired degree of purity with 
optional pharmaceutically acceptable carriers, excipients or stabilizers ( Remington's Pharmaceutical Sciences 
16th edition, Osol, A. Ed. (1980)), in the form of lyophilized formulations or aqueous solutions. Acceptable 

30 carriers, excipients, or stabilizers are nontoxic to recipients at the dosages and concentrations employed, and 
include buffers such as acetate, Tris, phosphate, citrate, and other organic acids; antioxidants including ascorbic 
acid and methionine; preservatives (such as octadecyldimethylbenzyl ammonium chloride; hexamethonium 
chloride; benzalkonium chloride, benzethonium chloride; phenol, butyl or benzyl alcohol; alkyl parabens such 
as methyl or propyl paraben; catechol; resorcinol; cyclohexanol; 3-pentanol; and m-cresol); low molecular 

35 weight (less than about 10 residues) polypeptides; proteins, such as serum albumin, gelatin, or immunoglobulins; 
hydrophilic polymers such as polyvinylpyrrolidone; amino acids such as glycine, glutamine, asparagine, 
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histidine, arginine, or lysine; monosaccharides, disaccharides, and other carbohydrates including glucose, 
mannose, or dextrins; chelating agents such as EDTA; tonicifiers such as trehalose and sodium chloride; sugars 
such as sucrose, mannitol, trehalose or sorbitol; surfactant such as polysorbate; salt-forming counter-ions such 
as sodium; metal complexes (e.g., Zn-protein complexes); and/or non-ionic surfactants such as TWEEN®, 
PLURONICS® or polyethylene glycol (PEG). The antibody preferably comprises the antibody at a 
5 concentration of between 5-200 mg/ml, preferably between 10-100 mg/ml. 

The formulations herein may also contain more than one active compound as necessary for the 
particular indication being treated, preferably those with complementary activities that do not adversely affect 
each other. For example, in addition to an anti-TAT antibody, TAT binding oligopeptide, or TAT binding 
organic molecule, it may be desirable to include in the one formulation, an additional antibody, e.g., a second 

1 0 anti-TAT antibody which binds a different epitope on the TAT polypeptide, or an antibody to some other target 
such as a growth factor that affects the growth of the particular cancer. Alternatively, or additionally, the 
composition may further comprise a chemotherapeutic agent, cytotoxic agent, cytokine, growth inhibitory agent, 
anti-hormonal agent, and/or cardioprotectant. Such molecules are suitably present in combination in amounts 
that are effective for the purpose intended. 

1 5 The active ingredients may also be entrapped in microcapsules prepared, for example, by coacervation 

techniques or by interfacial polymerization, for example, hydroxymethylcellulose or gelatin-microcapsules and 
poly-(methylmethacylate) microcapsules, respectively, in colloidal drug delivery systems (for example, 
liposomes, albumin microspheres, microemulsions, nano-particles and nanocapsules) or in macroemulsions. 
Such techniques are disclosed in Remington's Pharmaceutical Sciences, 16th edition, Osol, A. Ed. (1980). 

20 Sustained-release preparations may be prepared. Suitable examples of sustained-release preparations 

include semi-permeable matrices of solid hydrophobic polymers containing the antibody, which matrices are 
in the form of shaped articles, e.g., films, or microcapsules. Examples of sustained-release matrices include 
polyesters, hydrogels (for example, poly(2-hydroxyethyl-methacrylate), or poly(vinylalcohol)), polylactides 
(U.S. Pat. No. 3,773,919), copolymers of L-glutamic acid and y ethyl-L-glutamate, non-degradable ethylene- 

25 vinyl acetate, degradable lactic acid-glycolic acid copolymers such as the LUPRON DEPOT® (injectable 

microspheres composed of lactic acid-glycolic acid copolymer and leuprolide acetate), and poly-D-(-)-3- 
hydroxybutyric acid. 

The formulations to be used for in vivo administration must be sterile. This is readily accomplished 
by filtration through sterile filtration membranes. 
30 K. Diagnosis and Treatment with Anti-TAT Antibodies, TAT Binding Oligopeptides and TAT 

Binding Organic Molecules 
To determine TAT expression in the cancer, various diagnostic assays are available. In one 
embodiment, TAT polypeptide overexpression may be analyzed by immunohistochemistry (IHC). Parrafin 
embedded tissue sections from a tumor biopsy may be subjected to the IHC assay and accorded a TAT protein 
35 staining intensity criteria as follows: 
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Score 0 - no staining is observed or membrane staining is observed in less than 10% of tumor cells. 

Score 1 + - a faint/barely perceptible membrane staining is detected in more than 10% of the tumor 
cells. The cells are only stained in part of their membrane. 

Score 2+ - a weak to moderate complete membrane staining is observed in more than 10% of the 
tumor cells. 

. 5 Score 3+ - a moderate to strong complete membrane staining is observed in more than 10% of the 

tumor cells. 

Those tumors with 0 or 1 + scores for TAT polypeptide expression may be characterized as not 
overexpressing TAT, whereas those tumors with 2+ or 3+ scores may be characterized as overexpressing 
TAT. 

10 Alternatively, or additionally, FISH assays such as the INFORM® (sold by Ventana, Arizona) or 

PATHVISION® (Vysis, Illinois) may be carried out on formalin-fixed, paraffin-embedded tumor tissue to 
determine the extent (if any) of TAT overexpression in the tumor. 

TAT overexpression or amplification may be evaluated using an in vivo diagnostic assay, e.g., by 
administering a molecule (such as an antibody, oligopeptide or organic molecule) which binds the molecule to 

15 be detected and is tagged with a detectable label (e.g. , a radioactive isotope or a fluorescent label) and externally 
scanning the patient for localization of the label. 

As described above, the anti-TAT antibodies, oligopeptides and organic molecules of the invention have 
various non-therapeutic applications. The anti-TAT antibodies, oligopeptides and organic molecules of the 
present invention can be useful for diagnosis and staging of TAT polypeptide-expressing cancers (e.g., in 

20 radioimaging). The antibodies, oligopeptides and organic molecules are also useful for purification or 

immunoprecipitation of TAT polypeptide from cells, for detection and quantitation of TAT polypeptide in vitro , 
e.g. , in an ELISA or a Western blot, to kill and eliminate TAT-expressing cells from a population of mixed cells 
as a step in the purification of other cells. 

Currently, depending on the stage of the cancer, cancer treatment involves one or a combination of the 

25 following therapies: surgery to remove the cancerous tissue, radiation therapy, and chemotherapy. Anti-TAT 
antibody, oligopeptide or organic molecule therapy may be especially desirable in elderly patients who do not 
tolerate the toxicity and side effects of chemotherapy well and in metastatic disease where radiation therapy has 
limited usefulness. The tumor targeting anti-TAT antibodies, oligopeptides and organic molecules of the 
invention are useful to alleviate TAT-expressing cancers upon initial diagnosis of the disease or during relapse. 

30 For therapeutic applications, the anti-TAT antibody, oligopeptide or organic molecule can be used alone, or in 
combination therapy with, e.g., hormones, antiangiogens, or radiolabeled compounds, or with surgery, 
cryotherapy, and/or radiotherapy. Anti-TAT antibody, oligopeptide or organic molecule treatment can be 
administered in conjunction with other forms of conventional therapy, either consecutively with, pre- or post- 
conventional therapy. Chemotherapeutic drugs such as TAXOTERE® (docetaxel), TAXOL® (palictaxel), 

35 estramustine and mitoxantrone are used in treating cancer, in particular, in good risk patients. In the present 
method of the invention for treating or alleviating cancer, the cancer patient can be administered anti-TAT 
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antibody, oligopeptide or organic molecule in conjuction with treatment with the one or more of the preceding 
chemotherapeutic agents. In particular, combination therapy with palictaxel and modified derivatives (see, e.g., 
EP0600517) is contemplated. The anti-TAT antibody, oligopeptide or organic molecule will be administered 
with a therapeutically effective dose of the chemotherapeutic agent. In another embodiment, the anti-TAT 
antibody, oligopeptide or organic molecule is administered in conjunction with chemotherapy to enhance the 
5 activity and efficacy of the chemotherapeutic agent, e.g., paclitaxel. The Physicians' Desk Reference (PDR) 
discloses dosages of these agents that have been used in treatment of various cancers. The dosing regimen and 
dosages of these aforementioned chemotherapeutic drugs that are therapeutically effective will depend on the 
particular cancer being treated, the extent of the disease and other factors familiar to the physician of skill in 
the art and can be determined by the physician. 

10 In one particular embodiment, a conjugate comprising an anti-TAT antibody, oligopeptide or organic 

molecule conjugated with a cytotoxic agent is administered to the patient. Preferably, the immunoconjugate 
bound to the TAT protein is internalized by the cell, resulting in increased therapeutic efficacy of the 
immunoconjugate in killing the cancer cell to which it binds. In a preferred embodiment, the cytotoxic agent 
targets or interferes with the nucleic acid in the cancer cell. Examples of such cytotoxic agents are described 

15 above and include maytansinoids, calicheamicins, ribonucleases and DNA endonucleases. 

The anti-TAT antibodies, oligopeptides, organic molecules or toxin conjugates thereof are administered 
to a human patient, in accord with known methods, such as intravenous administration, e.g.,, as a bolus or by 
continuous infusion over a period of time, by intramuscular, intraperitoneal, intracerobrospinal, subcutaneous, 
intra-articular, intrasynovial, intrathecal, oral, topical, or inhalation routes. Intravenous or subcutaneous 

20 administration of the antibody, oligopeptide or organic molecule is preferred. 

Other therapeutic regimens may be combined with the administration of the anti-TAT antibody, 
oligopeptide or organic molecule. The combined administration includes co-administration, using separate 
formulations or a single pharmaceutical formulation, and consecutive administration in either order, wherein 
preferably there is a time period while both (or all) active agents simultaneously exert their biological activities. 

25 Preferably such combined therapy results in a synergistic therapeutic effect. 

It may also be desirable to combine administration of the anti-TAT antibody or antibodies, oligopeptides 
or organic molecules, with administration of an antibody directed against another tumor antigen associated with 
the particular cancer. 

In another embodiment, the therapeutic treatment methods of the present invention involves the 
30 combined administration of an anti-TAT antibody (or antibodies), oligopeptides or organic molecules and one 
or more chemotherapeutic agents or growth inhibitory agents, including co-administration of cocktails of 
different chemotherapeutic agents. Chemotherapeutic agents include estramustine phosphate, prednimustine, 
cisplatin, 5-fluorouracil, melphalan, cyclophosphamide, hydroxyurea and hydroxyureataxanes (such as paclitaxel 
and doxetaxel) and/or anthracycline antibiotics. Preparation and dosing schedules for such chemotherapeutic 
35 agents may be used according to manufacturers' instructions or as determined empirically by the skilled 
practitioner. Preparation and dosing schedules for such chemotherapy are also described in Chemotherapy 
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Service Ed., M.C. Perry, Williams & Wilkins, Baltimore, MD (1992). 

The antibody, oligopeptide or organic molecule may be combined with an anti-hormonal compound; 
e.g., an anti-estrogen compound such as tamoxifen; an anti-progesterone such as onapristone (see, EP 616 812); 
or an anti-androgen such as flutamide, in dosages known for such molecules. Where the cancer to be treated 
is androgen independent cancer, the patient may previously have been subjected to anti-androgen therapy and, 
5 after the cancer becomes androgen independent, the anti-TAT antibody, oligopeptide or organic molecule (and 
optionally other agents as described herein) may be administered to the patient. 

Sometimes, it may be beneficial to also co-administer a cardioprotectant (to prevent or reduce 
myocardial dysfunction associated with the therapy) or one or more cytokines to the patient. In addition to the 
above therapeutic regimes, the patient may be subjected to surgical removal of cancer cells and/or radiation 
10 therapy, before, simultaneously with, or post antibody, oligopeptide or organic molecule therapy. Suitable 
dosages for any of the above co-administered agents are those presently used and may be lowered due to the 
combined action (synergy) of the agent and anti-TAT antibody, oligopeptide or organic molecule. 

For the prevention or treatment of disease, the dosage and mode of administration will be chosen by 
the physician according to known criteria. The appropriate dosage of antibody, oligopeptide or organic molecule 
1 5 will depend on the type of disease to be treated, as defined above, the severity and course of the disease, 

whether the antibody, oligopeptide or organic molecule is administered for preventive or therapeutic purposes, 
previous therapy, the patient's clinical history and response to the antibody, oligopeptide or organic molecule, 

and the discretion of the attending physician. The antibody, oligopeptide or organic molecule is suitably 

i 

administered to the patient at one time or over a series of treatments. Preferably, the antibody, oligopeptide 
20 or organic molecule is administered by intravenous infusion or by subcutaneous injections. Depending on the 
type and severity of the disease, about 1 ng/kg to about 50 mg/kg body weight (e.g., about 0.1-15mg/kg/dose) 
of antibody can be an initial candidate dosage for administration to the patient, whether, for example, by one 
or more separate administrations, or by continuous infusion. A dosing regimen can comprise administering an 
initial loading dose of about 4 mg/kg, followed by a weekly maintenance dose of about 2 mg/kg of the anti-TAT 
25 antibody. However, other dosage regimens may be useful. A typical daily dosage might range from about 1 
pg/kg to 100 mg/kg or more, depending on the factors mentioned above. For repeated administrations over 
several days or longer, depending on the condition, the treatment is sustained until a desired suppression of 
disease symptoms occurs. The progress of this therapy can be readily monitored by conventional methods and 
assays and based on criteria known to the physician or other persons of skill in the art. 
30 Aside from administration of the antibody protein to the patient, the present application contemplates 

administration of the antibody by gene therapy. Such administration of nucleic acid encoding the antibody is 
encompassed by the expression "administering a therapeutically effective amount of an antibody". See, for 
example, WO96/07321 published March 14, 1996 concerning the use of gene therapy to generate intracellular 
antibodies. 

35 There are two major approaches to getting the nucleic acid (optionally contained in a vector) into the 

patient's cells; in vivo and ex vivo. For in vivo delivery the nucleic acid is injected directly into the patient, 
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usually at the site where the antibody is required. For ex vivo treatment, the patient's cells are removed, the 
nucleic acid is introduced into these isolated cells and the modified cells are administered to the patient either 
directly or, for example, encapsulated within porous membranes which are implanted into the patient (see, e.g. , 
U.S. Patent Nos. 4,892,538 and 5,283, 187). There are a variety of techniques available for introducing nucleic 
acids into viable cells. The techniques vary depending upon whether the nucleic acid is transferred into cultured 
cells in vitro, or in vivo in the cells of the intended host. Techniques suitable for the transfer of nucleic acid 
into mammal ian cells in vitro include the use of liposomes, electroporation, microinjection, cell fusion, DEAE- 
dextran, the calcium phosphate precipitation method, etc. A commonly used vector force vivo delivery of the 
gene is a retroviral vector. 

The currently preferred in vivo nucleic acid transfer techniques include transfection with viral vectors 
(such as adenovirus, Herpes simplex I virus, or adeno-associated virus) and lipid-based systems (useful lipids 
for lipid-mediated transfer of the gene are DOTMA, DOPE and DC-Choi, for example). For review of the 
currently known gene marking and gene therapy protocols see Anderson et al., Science 256:808-813 (1992). 
See also WO 93/25673 and the references cited therein. 

The anti-TAT antibodies of the invention can be in the different forms encompassed by the definition 
of "antibody" herein. Thus, the antibodies include full length or intact antibody, antibody fragments, native 
sequence antibody or amino acid variants, humanized, chimeric or fusion antibodies, immunoconjugates, and 
functional fragments thereof. In fusion antibodies an antibody sequence is fused to a heterologous polypeptide 
sequence. The antibodies can be modified in the Fc region to provide desired effector functions. As discussed 
in more detail in the sections herein, with the appropriate Fc regions, the naked antibody bound on the cell 
surface can induce cytotoxicity, e.g., via antibody-dependent cellular cytotoxicity (ADCC) or by recruiting 
complement in complement dependent cytotoxicity, or some other mechanism. Alternatively, where it is 
desirable to eliminate or reduce effector function, so as to minimize side effects or iherapeutic complications, 
certain other Fc regions may be used. 

In one embodiment, the antibody competes for binding or bind substantially to, the same epitope as the 
antibodies of the invention. Antibodies having the biological characteristics of the present anti-TAT antibodies 
of the invention are also contemplated, specifically including the in vivo tumor targeting and any cell 
proliferation inhibition or cytotoxic characteristics. 

Methods of producing the above antibodies are described in detail herein. 

The present anti-TAT antibodies, oligopeptides and organic molecules are useful for treating a TAT- 
expressing cancer or alleviating one or more symptoms of the cancer in a mammal. Such a cancer includes 
prostate cancer, cancer of the urinary tract, lung cancer, breast cancer, colon cancer and ovarian cancer, more 
specifically, prostate adenocarcinoma, renal cell carcinomas, colorectal adenocarcinomas, lung 
adenocarcinomas, lung squamous cell carcinomas, and pleural mesothelioma. The cancers encompass metastatic 
cancers of any of the preceding. The antibody, oligopeptide or organic molecule is able to bind to at least a 
portion of the cancer cells that express TAT polypeptide in the mammal. In a preferred embodiment, the 
antibody, oligopeptide or organic molecule is effective to destroy or kill TAT-expressing tumor cells or inhibit 
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the growth of such tumor cells, in vitro or in vivo, upon binding to TAT polypeptide on the cell. Such an 
antibody includes a naked anti-TAT antibody (not conjugated to any agent). Naked antibodies that have 
cytotoxic or cell growth inhibition properties can be further harnessed with a cytotoxic agent to render them 
even more potent in tumor cell destruction. Cytotoxic properties can be conferred to an anti-TAT antibody by, 
e.g., conjugating the antibody with a cytotoxic agent, to form an immunoconjugate as described herein. The 
5 cytotoxic agent or a growth inhibitory agent is preferably a small molecule. Toxins such as calicheamicin or 
a maytansinoid and analogs or derivatives thereof, are preferable. 

The invention provides a composition comprising an anti-TAT antibody, oligopeptide or organic 
molecule of the invention, and a carrier. For the purposes of treating cancer, compositions can be administered 
to the patient in need of such treatment, wherein the composition can comprise one or more anti-TAT antibodies 

10 present as an immunoconjugate or as the naked antibody. In a further embodiment, the compositions can 

comprise these antibodies, oligopeptides or organic molecules in combination with other therapeutic agents such 
as cytotoxic or growth inhibitory agents, including chemotherapeutic agents. The invention also provides 
formulations comprising an anti-TAT antibody, oligopeptide or organic molecule of the invention, and a carrier. 
In one embodiment, the formulation is a therapeutic formulation comprising a pharmaceutically acceptable 

15 carrier. 

Another aspect of the invention is isolated nucleic acids encoding the anti-TAT antibodies. Nucleic 
acids encoding both the H and L chains and especially the hypervariable region residues, chains which encode 
the native sequence antibody as well as variants, modifications and humanized versions of the antibody, are 
encompassed. 

20 The invention also provides methods useful for treating a TAT polypeptide-expressing cancer or 

alleviating one or more symptoms of the cancer in a mammal, comprising administering a therapeutically 
effective amount of an anti-TAT antibody, oligopeptide or organic molecule to the mammal. The antibody, 
oligopeptide or organic molecule therapeutic compositions can be administered short term (acute) or chronic, 
or intermittent as directed by physician. Also provided are methods of inhibiting the growth of, and killing a 

25 TAT polypeptide-expressing cell. 

The invention also provides kits and articles of manufacture comprising at least one anti-TAT antibody, 
oligopeptide or organic molecule. Kits containing anti-TAT antibodies, oligopeptides or organic molecules find 
use, e.g. , for TAT cell killing assays, for purification or immunoprecipitation of TAT polypeptide from cells. 
For example, for isolation and purification of TAT, the kit can contain an anti-TAT antibody, oligopeptide or 

30 organic molecule coupled to beads (e.g., sepharose beads). Kits can be provided which contain the antibodies, 
oligopeptides or organic molecules for detection and quantitation of TAT in vitro , e.g., in an ELISA or a 
Western blot. Such antibody, oligopeptide or organic molecule useful for detection may be provided with a 
label such as a fluorescent or radiolabel. 

L. Articles of Manufacture and Kits 

35 Another embodiment of the invention is an article of manufacture containing materials useful for the 

treatment of anti-TAT expressing cancer. The article of manufacture comprises a container and a label or 
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package insert on or associated with the container. Suitable containers include, for example, bottles, vials, 
syringes, etc. The containers may be formed from a variety of materials such as glass or plastic. The container 
holds a composition which is effective for treating the cancer condition and may have a sterile access port (for 
example the container may be an intravenous solution bag or a vial having a stopper pierceable by a hypodermic 
injection needle). At least one active agent in the composition is an anti-TAT antibody, oligopeptide or organic 
molecule of the invention. The label or package insert indicates that die composition is used for treating cancer. 
The label or package insert will further comprise instructions for administering the antibody, oligopeptide or 
organic molecule composition to the cancer patient. Additionally, the article of manufacture may further 
comprise a second container comprising a pharmaceuticaUy-acceptable buffer, such as bacteriostatic water for 
injection (BWFI), phosphate-buffered saline, Ringer's solution and dextrose solution. It may further include 
other materials desirable from a commercial and user standpoint, including other buffers, diluents, filters, 
needles, and syringes. 

Kits are also provided that are useful for various purposes , e.g., for TAT-expressing cell killing 
assays, for purification or immunoprecipitation of TAT polypeptide from cells. For isolation and purification 
of TAT polypeptide, the kit can contain an anti-TAT antibody, oligopeptide or organic molecule coupled to 
beads (e.g., sepharose beads). Kits can be provided which contain the antibodies, oligopeptides or organic 
molecules for detection and quantitation of TAT polypeptide in vitro, e.g. , in an ELISA or a Western blot. As 
with the article of manufacture, the kit comprises a container and a label or package insert on or associated with 
the container. The container holds a composition comprising at least one anti-TAT antibody, oligopeptide or 
organic molecule of the invention. Additional containers may be included that contain, e.g., diluents and 
buffers, control antibodies. The label or package insert may provide a description of the composition as well 
as instructions for the intended in vitro or diagnostic use. 

M - Uses for TAT Polypepti des and TAT-Polvpeptide Encoding Nucleic Acids 

Nucleotide sequences (or their complement) encoding TAT polypeptides have various applications in 
the art of molecular biology, including uses as hybridization probes, in chromosome and gene mapping and in 
the generation of anti-sense RNA and DNA probes. TAT-encoding nucleic acid will also be useful for the 
preparation of TAT polypeptides by the recombinant techniques described herein, wherein those TAT 
polypeptides may find use, for example, in the preparation of anti-TAT antibodies as described herein. 

The full-length native sequence TAT gene, or portions thereof, may be used as hybridization probes 
for a cDNA library to isolate the full-length TAT cDNA or to isolate still other cDNAs (for instance, those 
encoding naturally-occurring variants of TAT or TAT from other species) which have a desired sequence 
identity to the native TAT sequence disclosed herein. Optionally, the length of the probes will be about 20 to 
about 50 bases. The hybridization probes may be derived from at least partially novel regions of the full length 
native nucleotide sequence wherein those regions may be determined without undue experimentation or from 
genomic sequences including promoters, enhancer elements and introns of native sequence TAT. By way of 
example, a screening method will comprise isolating the coding region of the TAT gene using the known DNA 
sequence to synthesize a selected probe of about 40 bases. Hybridization probes may be labeled by a variety 
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of labels, including radionucleotides such as P or S, or enzymatic labels such as alkaline phosphatase coupled 
to the probe via avidin/biotin coupling systems. Labeled probes having a sequence complementary to that of 
the TAT gene of the present invention can be used to screen libraries of human cDNA, genomic DNA or mRNA 
to determine which members of such libraries the probe hybridizes to. Hybridization techniques are described 
in further detail in the Examples below. Any EST sequences disclosed in the present application may similarly 
5 be employed as probes, using the methods disclosed herein. 

Other useful fragments of the TAT-encoding nucleic acids include antisense or sense oligonucleotides 
comprising a singe-stranded nucleic acid sequence (either RNA or DNA) capable of binding to target TAT 
mRNA (sense) or TAT DNA (antisense) sequences. Antisense or sense oligonucleotides, according to the 
present invention, comprise a fragment of the coding region of TAT DNA. Such a fragment generally comprises 

10 at least about 14 nucleotides, preferably from about 14 to 30 nucleotides. The ability to derive an antisense or 
a sense oligonucleotide, based upon a cDNA sequence encoding a given protein is described in, for example, 
Stein and Cohen ( Cancer Res. 48:2659. 1988) and van der Krol et al. ( BioTechniques 6:958. 1988). 

Binding of antisense or sense oligonucleotides to target nucleic acid sequences results in the formation 
of duplexes that block transcription or translation of the target sequence by one of several means, including 

1 5 enhanced degradation of the duplexes, premature termination of transcription or translation, or by other means. 

Such methods are encompassed by the present invention. The antisense oligonucleotides thus may be used to 
block expression of TAT proteins, wherein those TAT proteins may play a role in the induction of cancer in 
mammals. Antisense or sense oligonucleotides further comprise oligonucleotides having modified sugar- 
phosphodiester backbones (or other sugar linkages, such as those described in WO 91/06629) and wherein such 

20 sugar linkages are resistant to endogenous nucleases. Such oligonucleotides with resistant sugar linkages are 
stable in vivo (i.e., capable of resisting enzymatic degradation) but retain sequence specificity to be able to bind 
to target nucleotide sequences. 

Preferred intragenic sites for antisense binding include the region incorporating the translation 
initiation/start codon (5'-AUG / 5'-ATG) or termination/stop codon (5'-UAA, 5'-UAG and 5-UGA / 5'-TAA, 

25 5'-TAG and 5*-TGA) of the open reading frame (ORF) of the gene. These regions refer to a portion of the 
mRNA or gene that encompasses from about 25 to about 50 contiguous nucleotides in either direction (i.e., 5* 
or 3') from a translation initiation or termination codon. Other preferred regions for antisense binding include: 
introns; exons; intron-exon junctions; the open reading frame (ORF) or "coding region," which is the region 
between the translation initiation codon and tide translation termination codon; the 5' cap of an mRNA which 

30 comprises an N7-methylated guanosine residue joined to the 5' -most residue of the mRNA via a 5* -5' 

triphosphate linkage and includes 5* cap structure itself as well as the first 50 nucleotides adjacent to the cap; 
the 5* untranslated region (5'UTR), the portion of an mRNA in the 5* direction from the translation initiation 
codon, and thus including nucleotides between the 5' cap site and the translation initiation codon of an mRNA 
or corresponding nucleotides on the gene; and the 3' untranslated region (3'UTR), the portion of an mRNA in 

35 the 3' direction from the translation termination codon, and thus including nucleotides between the translation 
termination codon and 3 f end of an mRNA or corresponding nucleotides on the gene. 
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Specific examples of preferred antisense compounds useful for inhibiting expression of TAT proteins 
include oligonucleotides containing modified backbones or non-natural internucleoside linkages. Oligonucleotides 
having modified backbones include those that retain a phosphorus atom in the backbone and those that do not 
have a phosphorus atom in the backbone. For the purposes of this specification, and as sometimes referenced 
in the art, modified oligonucleotides that do not have a phosphorus atom in their internucleoside backbone can 
5 also be considered to be oligonucleosides. Preferred modified oligonucleotide backbones include, for example, 
phosphorothioates, chiralphosphorothioates, phosphorodithioates, phosphotriesters, aminoalkylphosphotri-esters, 
methyl and other alkyl phosphonates including 3*-alkylene phosphonates, S'-allsylene phosphonates and chiral 
phosphonates, phosphinates, phosphoramidates including 3*-amino phosphoramidate and 
aminoalkylphosphoramidates, thionophosphoramidates, thionoalkylphosphonates, thionoalkylphosphotriesters, 

10 selenophosphates and borano-phosphates having normal 3'-5 % linkages, 2 f -5' linked analogs of these, and those 
having inverted polarity wherein one or more internucleotide linkages is a 3' to 3\ 5' to 5* or T to T linkage. 
Preferred oligonucleotides having inverted polarity comprise a single 3' to 3* linkage at the 3* -most 
internucleotide linkage i.e. a single inverted nucleoside residue which may be abasic (the nucleobase is missing 
or has a hydroxyl group in place thereof). Various salts, mixed salts and free acid forms are also included. 

15 Representative United States patents that teach the preparation of phosphorus-containing linkages include, but 
are not limited to, U.S. Pat. Nos.: 3,687,808; 4,469,863; 4,476,301; 5,023,243; 5,177,196; 5,188,897; 
5,264,423; 5,276,019; 5,278,302; 5,286,717; 5,321,131; 5,399,676; 5,405,939; 5,453,496; 5,455,233; 
5,466,677; 5,476,925; 5,519,126; 5,536,821; 5,541,306; 5,550,111; 5,563,253; 5,571,799; 5,587,361; 
5,194,599; 5,565,555; 5,527,899; 5,721,218; 5,672,697 and 5,625,050, each of which is herein incorporated 

20 by reference. 

Preferred modified oligonucleotide backbones that do not include a phosphorus atom therein have 
backbones that are formed by short chain alkyl or cycloalkyl internucleoside linkages, mixed heteroatom and 
alkyl or cycloalkyl internucleoside linkages, or one or more short chain heteroatomic or heterocyclic 
internucleoside linkages. These include those having morpholino linkages (formed in part from the sugar portion 

25 of a nucleoside); siloxane backbones; sulfide, sulfoxide and sulfone backbones; formacetyl and thioformacetyl 
backbones; methylene formacetyl and thioformacetyl backbones; riboacetyl backbones; alkene containing 
backbones; sulfamate backbones; methyleneimino and methylenehydrazino backbones; sulfonate and sulfonamide 
backbones; amide backbones; and others having mixed N, O, S and CH.sub.2 component parts. Representative 
United States patents that teach the preparation of such oligonucleosides include, but are not limited to,. U.S. 

30 Pat. Nos.: 5,034,506; 5,166,315; 5,185,444; 5,214,134; 5,216,141; 5,235,033; 5,264,562; 5,264,564; 

5,405,938; 5,434,257; 5,466,677; 5,470,967; 5,489,677; 5,541,307; 5,561,225; 5,596,086; 5,602,240; 
5,610,289; 5,602,240; 5,608,046; 5,610,289; 5,618,704; 5,623,070; 5,663,312; 5,633,360; 5,677,437; 
5,792,608; 5,646,269 and 5,677,439, each of which is herein incorporated by reference. 

In other preferred antisense oligonucleotides, both the sugar and the internucleoside linkage, i.e., the 

3 5 backbone, of the nucleotide units are replaced with novel groups . The base units are maintained for hybridization 
with an appropriate nucleic acid target compound. One such oligomeric compound, an oligonucleotide mimetic 
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that has been shown to have excellent hybridization properties, is referred to as a peptide nucleic acid (PNA). 
In PNA compounds, the sugar-backbone of an oligonucleotide is replaced with an amide containing backbone, 
in particular an aminoethylglycine backbone. The nucleobases are retained and are bound directly or indirectly 
to aza nitrogen atoms of the amide portion of the backbone. Representative United States patents that teach the 
preparation of PNA compounds include, but are not limited to, U.S. Pat. Nos.: 5,539,082; 5,714,331; and 
5,719,262, each of which is herein incorporated by reference. Further teaching of PNA compounds can be found 
in Nielsen et al., Science, 1991, 254, 1497-1500. 

Preferred anlisense oligonucleotides incorporate phosphorothioate backbones and/or heteroatom 
backbones, and in particular -CH 2 -NH-0-CH 2 -, -CH 2 -N(CH 3 )-0-CH 2 - [known as a methylene (methylimino) 
or MMI backbone], -CH^O-N^H^-CIV, -CH 2 -N(CH 3 )-N(CH 3 )-CH r and -0-N(CH 3 )-CH 2 -CH 2 - [wherein 
the native phosphodiester backbone is represented as -0-P-0-CH 2 -] described in the above referenced U.S. Pat. 
No. 5,489,677, and the amide backbones of the above referenced U.S. Pat. No. 5,602,240. Also preferred are 
antisense oligonucleotides having morpholino backbone structures of the above-referenced U.S. Pat. No. 
5,034,506. 

Modified oligonucleotides may also contain one or more substituted sugar moieties. Preferred 
oligonucleotides comprise one of the following at the 2' position: OH; F; O-alkyl, S-alkyl, or N-alkyl; O- 
alkenyl, S-alkeynyl, or N-alkenyl; O-alkynyl, S-alkynyl or N-alkynyl; or O-alkyl-O-alkyl, wherein the alkyl, 
alkenyl and alkynyl may be substituted or unsubstituted C t to C I0 alkyl or C 2 to C i0 alkenyl and alkynyl. 
Particularly preferred are 0[(CH 2 ) n O] m CH 3 , OCCH^OCHa, 0(OH^JNH 3 , ©(CH^CH,, OCCH^ONH^ and 
0(CH 2 ) 0 ON[(CH 2 ) n CH 3 )] 2 , where n and m are from 1 to about 10. Other preferred antisense oligonucleotides 
comprise one of the following at the V position: C a to C 10 lower alkyl, substituted lower alkyl, alkenyl, alkynyl, 
alkaryl, aralkyl, O-alkaryl or O-aralkyl, SH, SCH 3 , OCN, CI, Br, CN, CF 3 , OCF 3 , SOCH 3 , S0 2 CH 3 , ON0 2 , 
N0 2 , N 3 , NH 2 , heterocycloalkyl, heterocycloalkaryl, aminoalkylamino, polyalkylamino, substituted silyl, an 
RNA cleaving group, a reporter group, an intercalator, a group for improving the pharmacokinetic properties 
of an oligonucleotide, or a group for improving the pharmacodynamic properties of an oligonucleotide, and 
other substituents having similar properties. A preferred modification includes 2'-methoxyethoxy 
(2'-0-CH 2 CH 2 0CH 3 , also known as 2'-0-(2-methoxyethyl) or2 , -MOE) (Martinet al., Helv. Chim. Acta, 1995, 
78, 486-504) i.e., an alkoxyalkoxy group. A further preferred modification includes 
2'-dimethylaminooxyethoxy, i.e., a 0(CH 2 ) 2 ON(CH3) 2 group, also known as 2 , -DMAOE, as described in 
examples hereinbelow, and 2 , -dimethylaminoethoxyethoxy (also known in the art as 
2 , -0-dimethylaminoethoxyethyl or 2 , -DMAEOE), i.e., 2 , -0-CH 2 -0-CH 2 -N(CH 2 ). 

A further prefered modification includes Locked Nucleic Acids (LNAs) in which the 2'-hydroxyl group 
is linked to the 3' or 4' carbon atom of the sugar ring thereby forming a bicyclic sugar moiety. The linkage is 
preferably a methelyne (-CH 2 -) a group bridging the V oxygen atom and the 4* carbon atom wherein n is 1 or 
2. LNAs and preparation thereof are described in WO 98/39352 and WO 99/14226. 

Other preferred modifications include 2'-methoxy (2 t -0-CH 3 ), 2'-aminopropoxy (2'-OCH 2 CH 2 CH 2 
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NH^, 2'-allyl ^-CH^CH^CH*), 2'-0-aIlyl (2 , -0-CH r CH=CH 2 ) and 2'-fluoro (2'-F). The 2 '-modification 
may be in the arabino (up) position or ribo (down) position. A preferred 2'-arabino modification is 2*-F. Similar 
modifications may also be made at other positions on the oligonucleotide, particularly the 3 ' position of the sugar 
on the 3' terminal nucleotide or in 2'-5' linked oligonucleotides and the 5' position of 5 1 terminal nucleotide. 
Oligonucleotides may also have sugar mimetics such as cyclobutyl moieties in place of the pentofiiranosyl sugar. 
Representative United States patents that teach the preparation of such modified sugar structures include, but 
are not limited to, U.S. Pat. Nos.: 4,981,957; 5,118,800; 5,319,080; 5,359,044; 5,393,878; 5,446,137; 
5,466,786; 5,514,785; 5,519,134; 5,567,811; 5,576,427; 5,591,722; 5,597,909; 5,610,300; 5,627,053; 
5,639,873; 5,646,265; 5,658,873; 5,670,633; 5,792,747; and 5,700,920, each of which is herein incorporated 
by reference in its entirety. 

Oligonucleotides may also include nucleobase (often referred to in the art simply as "base") 
modifications or substitutions. As used herein, "unmodified" or "natural" nucleobases include the purine bases 
adenine (A) and guanine (G), and the pyrimidine bases thymine (T), cytosine (C) and uracil (U). Modified 
nucleobases include other synthetic and natural nucleobases such as 5-methylcytosine (5-me-C), 
5-hydroxymethyl cytosine, xanthine, hypoxanthine, 2-aminoadenine, 6-methyl and other alkyl derivatives of 
adenine and guanine, 2-propyl and other alkyl derivatives of adenine and guanine, 2-thiouracil, 2-thiothymine 
and 2-thiocytosine, 5-halouracil and cytosine, 5-propynyl (-C=C-CH 3 or -CH 2 -C =CH) uracil and cytosine and 
other alkynyl derivatives of pyrimidine bases, 6-azo uracil, cytosine and thymine, 5-uracil (pseudouracil), 

4- thiouracil, 8-halo, 8-amino, 8-thiol, 8-thioalkyl, 8-hydroxyl and other 8-substituted adenines and g uanine s, 

5- halo particularly 5-bromo, 5-trifluoromethyl and other 5-substituted uracils and cytosines, 7-methylguanine 
and 7-methyladenine, 2-F-adenine, 2-amino-adenine, 8-azaguanine and 8-azaadenine, 7-deazaguanine and 
7-deazaadenine and 3-deazaguanine and 3-deazaadenine. Further modified nucleobases include tricyclic 
pyrimidines such as phenoxazine cytidine(lH-pyrimido[5,4-b][l,4]benzoxazin-2(3H)-one), phenothiazine 
cytidine (lH-pyrimido[5,4-b][l ,4]benzothiazin-2(3H)-one), G-clamps such as a substituted phenoxazine cytidine 
(e.g. 9-(2-aminoethoxy) -H-pyrimido[5 , 4-b] [ 1 , 4] benzoxazin-2(3 H) -one) , carbazole cytidine 
(2H-pyrimido[4,5-b]indol-2-one), pyridoindole cytidine (H-pyridoP'^'^^lpyrroloP^-dlpyrimidin^-one). 
Modified nucleobases may also include those in which the purine or pyrimidine base is replaced with other 
heterocycles, for example 7-deaza-adenine, 7-deazaguanosine, 2-aminopyridine and 2-pyridone. Further 
nucleobases include those disclosed in U.S. Pat. No. 3,687,808, those disclosed in The Concise Encyclopedia 
Of Polymer Science And Engineering, pages 858-859, Kroschwitz, J. I., ed. John Wiley & Sons, 1990, and 
those disclosed by Englisch et al. , Angewandte Chemie, International Edition, 1991 , 30, 613. Certain of these 
nucleobases are particularly useful for increasing the binding affinity of the oligomeric compounds of the 
invention. These include 5-substituted pyrimidines, 6-azapyrimidines and N-2, N-6 and 0-6 substituted purines, 
including 2-aminopropyladenine, 5-propynyluraciland5-propynylcytosine. 5-methylcytosine substitutions have 
been shown to increase nucleic acid duplex stability by 0.6-1. 2.degree. C. (Sanghvi et al, Antisense Research 
and Applications, CRC Press, Boca Raton, 1993, pp. 276-278) and are preferred base substitutions, even more 
particularly when combined with 2 , -0-methoxyethyl sugar modifications. Representative United States patents 
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that teach the preparation of modified micleobases include, but are not limited to: U.S. Pat. No. 3,687,808, as 
well as U.S. Pat. Nos.: 4,845,205; 5,130,302; 5,134,066; 5,175,273; 5,367,066; 5,432,272; 5,457,187; 
5,459,255; 5,484,908; 5,502,177; 5,525,711; 5,552,540; 5,587,469; 5,594,121, 5,596,091; 5,614,617; 
5,645,985; 5,830,653; 5,763,588; 6,005,096; 5,681,941 and 5,750,692, each of which is herein 'incorporated 
by reference. 

5 Another modification of antisense oligonucleotides chemically linking to the oligonucleotide one or 

more moieties or conjugates which enhance the activity, cellular distribution or cellular uptake of the 
oligonucleotide. The compounds of the invention can include conjugate groups covalently bound to functional 
groups such as primary or secondary hydroxyl groups. Conjugate groups of the invention include intercalators, 
reporter molecules, polyamines, polyamides, polyethylene glycols, polyethers, groups that enhance the 

10 pharmacodynamic properties of oligomers, and groups that enhance the pharmacokinetic properties of 

oligomers. Typical conjugates groups include cholesterols, lipids, cation lipids, phospholipids, cationic 
phospholipids, biotin, phenazine, folate, phenanthridine, anthraquinone, acridine, fluoresceins, rhodamines, 
coumarins, and dyes. Groups that enhance the pharmacodynamic properties, in the context of this invention, 
include groups that improve oligomer uptake, enhance oligomer resistance to degradation, and/or strengthen 

1 5 sequence-specific hybridization with RNA. Groups that enhance the pharmacokinetic properties, in the context 
of this invention, include groups that improve oligomer uptake, distribution, metabolism or excretion. Conjugate 
moieties include but are not limited to lipid moieties such as a cholesterol moiety (Letsinger et al., Proc. Natl. 
Acad. Sci. USA, 1989, 86, 6553-6556), cholic acid (Manoharan et al., Bioorg. Med. Chem. Let., 1994, 4, 
1053-1060), a thioether, e.g. , hexyl-S-tritylthiol (Manoharan et al. , Ann. N. Y. Acad. Sci. , 1992, 660, 306-309; 

20 Manoharan et al., Bioorg. Med. Chem. Let., 1993, 3, 2765-2770), a thiocholesterol (Oberhauser et al., Nucl. 

Acids Res., 1992, 20, 533-538), an aliphatic chain, e.g., dodecandiol or undecyl residues (Saison-Behmoaras 
et al., EMBO J., 1991, 10, 1111-1118; Kabanov et al., FEBS Lett., 1990, 259, 327-330; Svinarchuk et al., 
Biochimie, 1993, 75, 49-54), a phospholipid, e.g., di-hexadecyl-rac-glycerol or triethyl-ammonium 
l,2-di-0-hexadecyl-rac-glycero-3-H-phosphonate (Manoharan etal., Tetrahedron Lett., 1995, 36, 3651-3654; 

25 Shea et al., Nucl. Acids Res., 1990, 18, 3777-3783), a polyamine or a polyethylene glycol chain (Manoharan 
et al., Nucleosides & Nucleotides, 1995, 14, 969-973), or adamantane acetic acid (Manoharan et al., 
Tetrahedron Lett., 1995, 36, 3651-3654), a palmityl moiety (Mishra et al., Biochim. Biophys. Acta, 1995, 
1264, 229-237), or an octadecylamine or hexylamino-carbonyl-oxycholesterol moiety. Oligonucleotides of the 
invention may also be conjugated to active drug substances, for example, aspirin, warfarin, phenylbutazone, 

30 ibuprofen, suprofen, fenbufen, ketoprofen, (S)-(4-)-pranoprofen, carprofen, dansylsarcosine, 

2,3,5-triiodobenzoic acid, flufenamic acid, folinic acid, a benzothiadiazide, chlorothiazide, a diazepine, 
indomethicin, a barbiturate, a cephalosporin, a sulfa drug, an antidiabetic, an antibacterial or an antibiotic. 
Oligonucleotide-drug conjugates and their preparation are described in U.S. patent application Ser. No. 
09/334,130 (filed Jun. 15, 1999) and United States patents Nos.: 4,828,979; 4,948,882; 5,218,105; 5,525,465; 

35 5,541,313; 5,545,730; 5,552,538; 5,578,717, 5,580,731; 5,580,731; 5,591,584; 5,109,124; 5,118,802; 

5,138,045; 5,414,077; 5,486,603; 5,512,439; 5,578,718; 5,608,046; 4,587,044; 4,605,735; 4,667,025; 
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4,762,779; 4,789,737; 4,824,941; 4,835,263; 4,876,335; 4,904,582; 4,958,013; 5,082,830; 5,112,963; 
5,214,136; 5,082,830; 5,112,963; 5,214,136; 5,245,022; 5,254,469; 5,258,506; 5,262,536; 5,272,250; 
5,292,873; 5,317,098; 5,371,241, 5,391,723; 5,416,203, 5,451,463; 5,510,475; 5,512,667; 5,514,785; 
5,565,552; 5,567,810; 5,574,142; 5,585,481; 5,587,371; 5,595,726; 5,597,696; 5,599,923; 5,599,928 and 
5,688,941, each of which is herein incorporated by reference. 
5 It is not necessary for all positions in a given compound to be uniformly modified, and in fact more 

than one of the aforementioned modifications may be incorporated in a single compound or even at a single 
nucleoside within an oligonucleotide. The present invention also includes antisense compounds which are 
chimeric compounds. "Chimeric" antisense compounds or "chimeras," in the context of this invention, are 
antisense compounds, particularly oligonucleotides, which contain two or more chemically distinct regions, each 

10 made up of at least one monomer unit, i.e., a nucleotide in the case of an oligonucleotide compound. These 
oligonucleotides typically contain at least one region wherein the oligonucleotide is modified so as to confer upon 
the oligonucleotide increased resistance to nuclease degradation, increased cellular uptake, and/or increased 
binding affinity for the target nucleic acid. An additional region of the oligonucleotide may serve as a substrate 
for enzymes capable of cleaving RNAzDNA or RNA:RNA hybrids. By way of example, RNase H is a cellular 

1 5 endonuclease which cleaves the RNA strand of an RNArDNA duplex. Activation of RNase H, therefore, results 
in cleavage of the RNA target, thereby greatly enhancing the efficiency of oligonucleotide inhibition of gene 
expression. Consequently, comparable results can often be obtained with shorter oligonucleotides when chimeric 
oligonucleotides are used, compared to phosphorothioate deoxyoligonucleotides hybridizing to the same target 
region. Chimeric antisense compounds of the invention may be formed as composite structures of two or more 
. 20 oligonucleotides, modified oligonucleotides, oligonucleosides and/or oligonucleotide mimetics as described 
above. Preferred chimeric antisense oligonucleotides incorporate at least one T modified sugar (preferably 
2 , -0-(CH 2 ) 2 -0-CH 3 ) at the 3' terminal to confer nuclease resistance and a region with at least 4 contiguous 2'-H 
sugars to confer RNase H activity. Such compounds have also been referred to in the art as hybrids or gapmers. 
Preferred gapmers have a region of 2' modified sugars (preferably 2 , -0-(CH 2 ) 2 -0-CH 3 ) at the S'-terminal and 

25 at the 5' terminal separated by at least one region having at least 4 contiguous 2'-H sugars and preferably 

incorporate phosphorothioate backbone linkages. Representative United States patents that teach the preparation 
of such hybrid structures include, but are not limited to, U.S. Pat. Nos. 5,013,830; 5,149,797; 5,220,007; 
5,256,775; 5,366,878; 5,403,711; 5,491,133; 5,565,350; 5,623,065; 5,652,355; 5,652,356; and 5,700,922, 
each of which is herein incorporated by reference in its entirety. 

30 The antisense compounds used in accordance with this invention may be conveniently and routinely 

made through the well-known technique of solid phase synthesis. Equipment for such synthesis is sold by 
several vendors including, for example, Applied Biosystems (Foster City, Calif.). Any other means for such 
synthesis known in the art may additionally or alternatively be employed. It is well known to use similar 
techniques to prepare oligonucleotides such as the phosphorothioates and alkylated derivatives. The compounds 

35 of the invention may also be admixed, encapsulated, conjugated or otherwise associated with other molecules, 
molecule structures or mixtures of compounds, as for example, liposomes, receptor targeted molecules, oral, 
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rectal, topical or other formulations, for assisting in uptake, distribution and/or absorption. Representative 
United States patents that teach the preparation of such uptake, distribution and/or absorption assisting 
formulations include, but are not limited to, U.S. Pat. Nos. 5,108,921; 5,354,844; 5,416,016; 5,459,127; 
5,521,291; 5,543,158; 5,547,932; 5,583,020; 5,591,721; 4,426,330; 4,534,899; 5,013,556; 5,108,921; 
5,213,804; 5,227,170; 5,264,221; 5,356,633; 5,395,619; 5,416,016; 5,417,978; 5,462,854; 5,469,854; 
5,512,295; 5,527,528; 5,534,259; 5,543,152; 5,556,948; 5,580,575; and 5,595,756, each of which is herein 
incorporated by reference. 

Other examples of sense or antisense oligonucleotides include those oligonucleotides which are 
covalently linked to organic moieties, such as those described in WO 90/10048, and other moieties that increases 
affinity of the oligonucleotide for a target nucleic acid sequence, such as poly-(L-lysine). Further still, 
intercalating agents, such as ellipticine, and alkylating agents or metal complexes may be attached to sense or 
antisense oligonucleotides to modify binding specificities of the antisense or sense oligonucleotide for the target 
nucleotide sequence. 

Antisense or sense oligonucleotides may be introduced into a cell containing the target nucleic acid 
sequence by any gene transfer method, including, for example, CaPO 4 -mediated DNA transfection, 

electroporation, or by using gene transfer vectors such as Epstein-Barr virus. In a preferred procedure, an 
antisense or sense oligonucleotide is inserted into a suitable retroviral vector. A cell containing the target 
nucleic acid sequence is contacted with the recombinant retroviral vector, either in vivo or ex vivo. Suitable 
retroviral vectors include, but are not limited to, those derived from the murine retrovirus M-MuLV, N2 (a 
retrovirus derived from M-MuLV), or the double copy vectors designated DCT5A, DCT5B and DCT5C (see 
WO 90/13641). 

Sense or antisense oligonucleotides also may be introduced into a cell containing the target nucleotide 
sequence by formation of a conjugate with a ligand binding molecule, as described in WO 91/04753. Suitable 
ligand binding molecules include, but are not limited to, cell surface receptors, growth factors, other cytokines, 
or other ligands that bind to cell surface receptors. Preferably, conjugation of the ligand binding molecule does 
not substantially interfere with the ability of the ligand binding molecule to bind to its corresponding molecule 
or receptor, or block entry of the sense or antisense oligonucleotide or its conjugated version into the cell. 

Alternatively, a sense or an antisense oligonucleotide may be introduced into a cell containing the target 
nucleic acid sequence by formation of an oligonucleotide-lipid complex, as described in WO 90/10448. The 
sense or antisense oligonucleotide-lipid complex is preferably dissociated within the cell by an endogenous 
lipase. 

Antisense or sense RNA or DNA molecules are generally at least about 5 nucleotides in length, 
alternatively at least about 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 
28, 29, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 105, 110, 115, 120, 125, 130, 135, 140, 
145, 150, 155, 160, 165, 170, 175, 180, 185, 190, 195, 200, 210, 220, 230, 240, 250, 260, 270, 280, 290, 
300, 310, 320, 330, 340, 350, 360, 370, 380, 390, 400, 410, 420, 430, 440, 450, 460, 470, 480, 490, 500, 
510, 520, 530, 540, 550, 560, 570, 580, 590, 600, 610, 620, 630, 640, 650, 660, 670, 680, 690, 700, 710, 
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720, 730, 740, 750, 760, 770, 780, 790, 800, 810, 820, 830, 840, 850, 860, 870, 880, 890, 900, 910, 920, 
930, 940, 950, 960, 970, 980, 990, or 1000 nucleotides in length, wherein in this context the term "about" 
means the referenced nucleotide sequence length plus or minus 10% of that referenced length. 

The probes may also be employed in PCR techniques to generate a pool of sequences for identification 
of closely related TAT coding sequences. 
5 Nucleotide sequences encoding a TAT can also be used to construct hybridization probes for mapping 

the gene which encodes that TAT and for the genetic analysis of individuals with genetic disorders. The 
nucleotide sequences provided herein may be mapped to a chromosome and specific regions of a chromosome 
using known techniques, such as in situ hybridization, linkage analysis against known chromosomal markers, 
and hybridization screening with libraries. 

1 0 When the coding sequences for TAT encode a protein which binds to another protein (example, where 

the TAT is a receptor), the TAT can be used in assays to identify the other proteins or molecules involved in 
the binding interaction. By such methods, inhibitors of the receptor/ligand binding interaction can be identified. 
Proteins involved in such binding interactions can also be used to screen for peptide or small molecule inhibitors 
or agonists of the binding interaction. Also, the receptor TAT can be used to isolate correlative ligand(s). 

1 5 Screening assays can be designed to find lead compounds that mimic the biological activity of a native TAT or 
a receptor for TAT. Such screening assays will include assays amenable to high-throughput screening of 
chemical libraries, making them particularly suitable for identifying small molecule drug candidates. Small 
molecules contemplated include synthetic organic or inorganic compounds. The assays can be performed in a 
variety of formats, including protein-protein binding assays, biochemical screening assays, immunoassays and 

20 cell based assays, which are well characterized in the art. 

Nucleic acids which encode TAT or its modified forms can also be used to generate either transgenic 
animals or "knock out" animals which, in turn, are useful in the development and screening of therapeutically 
useful reagents. A transgenic animal (e.g., a mouse or rat) is an animal having cells that contain a transgene, 
which transgene was introduced into the animal or an ancestor of the animal at a prenatal, e.g., an embryonic 

25 stage. A transgene is a DNA which is integrated into the genome of a cell from which a transgenic animal 
develops. In one embodiment, cDNA encoding TAT can be used to clone genomic DNA encoding TAT in 
accordance with established techniques and the genomic sequences used to generate transgenic animals that 
contain cells which express DNA encoding TAT. Methods for generating transgenic animals, particularly 
animals such as mice or rats, have become conventional in the art and are described, for example, in U.S. 

30 Patent Nos. 4,736,866 and 4,870,009. Typically, particular cells would be targeted for TAT transgene 

incorporation with tissue-specific enhancers. Transgenic animals that include a copy of a transgene encoding 
TAT introduced into the germ line of the animal at an embryonic stage can be used to examine the effect of 
increased expression of DNA encoding TAT. Such animals can be used as tester animals for reagents thought 
to confer protection from, for example, pathological conditions associated with its overexpression. In 

3 5 accordance with this facet of the invention, an animal is treated with the reagent and a reduced incidence of the 
pathological condition, compared to untreated animals bearing the transgene, would indicate a potential 
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therapeutic intervention for the pathological condition. 

Alternatively, non-human homologies of TAT can be used to construct a TAT "knock out" animal 
which has a defective or altered gene encoding TAT as a result of homologous recombination between the 
endogenous gene encoding TAT and altered genomic DNA encoding TAT introduced into an embryonic stem 
cell of the animal. For example, cDNA encoding TAT can be used to clone genomic DNA encoding TAT in 
5 accordance with established techniques. A portion of the genomic DNA encoding TAT can be deleted or 

replaced with another gene, such as a gene encoding a selectable marker which can be used to monitor 
integration. Typically, several kilobases of unaltered flanking DNA (both at the 5 ' and 3* ends) are included 
in the vector [see e.g., Thomas and Capecchi, Cell. 51:503 (1987) for a description of homologous 
recombination vectors]. The vector is introduced into an embryonic stem cell line (e.g., by electroporation) 

10 and cells in which the introduced DNA has homologously recombined with the endogenous DNA are selected 
[see e.g., Li et al., Cell. 69:915 (1992)]. The selected cells are then injected into a blastocyst of an animal 
(e.g., a mouse or rat) to form aggregation chimeras [see e.g., Bradley, in Teratocarcinomas and Embryonic 
Stem Cells: A Practical Approach, E. J. Robertson, ed. (IRL, Oxford, 1987), pp. 113-152]. A chimeric embryo 
can then be implanted into a suitable pseudopregnant female foster animal and the embryo brought to term to 

15 create a "knock out" animal. Progeny harboring the homologously recombined DNA in their germ cells can 
be identified by standard techniques and used to breed animals in which all cells of the animal contain the 
homologously recombined DNA. Knockout animals can be characterized for instance, for their ability to defend 
against certain pathological conditions and for their development of pathological conditions due to absence of 
the TAT polypeptide. 

20 Nucleic acid encoding the TAT polypeptides may also be used in gene therapy. In gene therapy 

applications, genes are introduced into cells in order to achieve in vivo synthesis of a therapeutically effective 
genetic product, for example for replacement of a defective gene. "Gene therapy" includes both conventional 
gene therapy where a lasting effect is achieved by a single treatment, and the administration of gene therapeutic 
agents, which involves the one time or repeated administration of a therapeutically effective DNA or mRNA. 

25 Antisense RNAs and DNAs can be used as therapeutic agents for blocking the expression of certain genes in 
vivo. It has already been shown that short antisense oligonucleotides can be imported into cells where they act 
as inhibitors, despite their low intracellular concentrations caused by their restricted uptake by the cell 
membrane. (Zamecnikef a/., Proc. Natl. Acad. Sci. USA 83:4143-4146 T19861). The oligonucleotides can be 
modified to enhance their uptake, e.g. by substituting their negatively charged phosphodiester groups by 

30 uncharged groups. 

There are a variety of techniques available for introducing nucleic acids into viable cells. The 
techniques vary depending upon whether the nucleic acid is transferred into cultured cells in vitro, or in vivo 
in the cells of the intended host. Techniques suitable for the transfer of nucleic acid into mammalian cells in 
vitro include the use of liposomes, electroporation, microinjection, cell fusion, DEAE-dextran, the calcium 

35 phosphate precipitation method, etc. The currently preferred in vivo gene transfer techniques include 
transfection with viral (typically retroviral) vectors and viral coat protein-liposome mediated transfection (Dzau 
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et al-. Trends in Biotechnology 11, 205-210 [1993]). In some situations it is desirable to provide the nucleic 
acid source with an agent that targets the target cells, such as an antibody specific for a cell surface membrane 
protein or the target cell, a ligand for a receptor on the target cell, etc. Where liposomes are employed, proteins 
which bind to a cell surface membrane protein associated with endocytosis may be used for targeting and/or to 
facilitate uptake, e.g. capsid proteins or fragments thereof tropic for a particular cell type, antibodies for 
proteins which undergo internalization in cycling, proteins that target intracellular localization and enhance 
intracellular half-life. The technique of receptor-mediated endocytosis is described, for example, by Wu et al. , 
J.Biol. Chem. 262, 4429-4432 (1987); and Wagner et al., Proc. Natl. Acad. Sci. USA 87. 3410-3414 (1990). 
For review of gene marking and gene therapy protocols see Anderson et al,, Science 256, 808-813 (1992). 

The nucleic acid molecules encoding the TAT polypeptides or fragments thereof described herein are 
useful for chromosome identification. In this regard, there exists an ongoing need to identify new chromosome 
markers, since relatively few chromosome marking reagents, based upon actual sequence data are presently 
available. Each TAT nucleic acid molecule of the present invention can be used as a chromosome marker. 

The TAT polypeptides and nucleic acid molecules of the present invention may also be used 
diagnosticaily for tissue typing, wherein the TAT polypeptides of the present invention may be differentially 
expressed in one tissue as compared to another, preferably in a diseased tissue as compared to a normal tissue 
of the same tissue type. TAT nucleic acid molecules will find use for generating probes for PCR, Northern 
analysis, Southern analysis and Western analysis. 

This invention encompasses methods of screening compounds to identify those that mimic the TAT 
polypeptide (agonists) or prevent the effect of the TAT polypeptide (antagonists). Screening assays for 
antagonist drug candidates are designed to identify compounds that bind or complex with the TAT polypeptides 
encoded by the genes identified herein, or otherwise interfere with the interaction of the encoded polypeptides 
with other cellular proteins, including e.g., inhibiting the expression of TAT polypeptide from cells. Such 
screening assays will include assays amenable to high-throughput screening of chemical libraries, making them 
particularly suitable for identifying small molecule drug candidates. 

The assays can be performed in a variety of formats, including protein-protein binding assays, 
biochemical screening assays, immunoassays, and cell-based assays, which are well characterized in the art. 

All assays for antagonists are common in that they call for contacting the drug candidate with a TAT 
polypeptide encoded by a nucleic acid identified herein under conditions and for a time sufficient to allow these 
two components to interact. 

In binding assays, the interaction is binding and the complex formed can be isolated or detected in the 
reaction mixture. In a particular embodiment, the TAT polypeptide encoded by the gene identified herein or 
the drug candidate is immobilized on a solid phase, e.g., on a microtiter plate, by covalent or non-covalent 
attachments. Non-covalent attachment generally is accomplished by coating the solid surface with a solution 
of the TAT polypeptide and drying. Alternatively, an immobilized antibody, e.g., a monoclonal antibody, 
specific for the TAT polypeptide to be immobilized can be used to anchor it to a solid surface. The assay is 
performed by adding the non-immobilized component, which may be labeled by a detectable label, to the 
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immobilized component, e.g., the coated surface containing the anchored component. When the reaction is 
complete, the non-reacted components are removed, e.g., by washing, and complexes anchored on die solid 
surface are detected. When the originally non-immobilized component carries a detectable label, the detection 
of label immobilized on the surface indicates that complexing occurred. Where the originally non-immobilized 
component does not carry a label, complexing can be detected, for example, by using a labeled antibody 
specifically binding the immobilized complex. 

If the candidate compound interacts with but does not bind to a particular TAT polypeptide encoded 
by a gene identified herein, its interaction with that polypeptide can be assayed by methods well known for 
detecting protein-protein interactions. Such assays include traditional approaches, such as, e.g., cross-linking, 
co-immunoprecipitation, and co-purification through gradients or chromatographic columns. In addition, 
protein-protein interactions can be monitored by using a yeast-based genetic system described by Fields and co- 
workers (Fields and Song, Nature (London). 340:245-246 (1989); Chien et al., Proc. Natl. Acad. Sci. USA, 
88:9578-9582 (1991)) as disclosed by Chevray and Nathans, Proc. Natl. Acad. Sci. USA. 89: 5789-5793 
(1991). Many transcriptional activators, such as yeast GAL4, consist of two physically discrete modular 
domains, one acting as the DNA-binding domain, the other one functioning as the transcription-activation 
domain. The yeast expression system described in the foregoing publications (generally referred to as the "two- 
hybrid system") takes advantage of this property, and employs two hybrid proteins, one in which the target 
protein is fused to the DNA-binding domain of GAL4, and another, in which candidate activating proteins are 
fused to the activation domain. The expression of a GAL1- lacZ reporter gene under control of a GAL4- 
activated promoter depends on reconstitution of GAL4 activity via protein-protein interaction. Colonies 
containing interacting polypeptides are detected with a chromogenic substrate for p-galactosidase. A complete 
kit (MATCHMAKER™) for identifying protein-protein interactions between two specific proteins using the two- 
hybrid technique is commercially available from Clontech. This system can also be extended to map protein 
domains involved in specific protein interactions as well as to pinpoint amino acid residues that are crucial for 
these interactions. 

Compounds that interfere with the interaction of a gene encoding a TAT polypeptide identified herein 
and other intra- or extracellular components can be tested as follows: usually a reaction mixture is prepared 
containing the product of the gene and the intra- or extracellular component under conditions and for a time 
allowing for the interaction and binding of the two products. To test the ability of a candidate compound to 
inhibit binding, the reaction is run in the absence and in the presence of the test compound. In addition, a 
placebo may be added to a third reaction mixture, to serve as positive control. The binding (complex formation) 
between the test compound and the intra- or extracellular component present in the mixture is monitored as 
described hereinabove. The formation of a complex in the control reactions) but not in the reaction mixture 
containing the test compound indicates that the test compound interferes with the interaction of the test 
compound and its reaction partner. 

To assay for antagonists, the TAT polypeptide may be added to a cell along with the compound to be 
screened for a particular activity and the ability of the compound to inhibit the activity of interest in the presence 
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of the TAT polypeptide indicates that the compound is an antagonist to the TAT polypeptide. Alternatively, 
antagonists may be detected by combining the TAT polypeptide and a potential antagonist with membrane-bound 
TAT polypeptide receptors or recombinant receptors under appropriate conditions for a competitive inhibition 
assay. The TAT polypeptide can be labeled, such as by radioactivity, such that the number of TAT polypeptide 
molecules bound to the receptor can be used to determine the effectiveness of the potential antagonist. The gene 
5 encoding the receptor can be identified by numerous methods known to those of skill in the art, for example, 
ligand panning and FACS sorting. Coliganetal., Current Protocols in Immun.. 1(2): Chapter 5 (1991). 
Preferably, expression cloning is employed wherein polyadenylated RNA is prepared from a cell responsive to 
die TAT polypeptide and a cDNA library created from this RNA is divided into pools and used to transfect COS 
cells or other cells that are not responsive to the TAT polypeptide. Transfected cells that are grown on glass 

10 slides are exposed to labeled TAT polypeptide. The TAT polypeptide can be labeled by a variety of means 
including iodination or inclusion of a recognition site for a site-specific protein kinase. Following fixation and 
incubation, the slides are subjected to autoradiographic analysis. Positive pools are identified and sub-pools are 
prepared and re-transfected using an interactive sub-pooling and re-screening process, eventually yielding a 
single clone that encodes the putative receptor. 

15 As an alternative approach for receptor identification, labeled TAT polypeptide can be photoaffinity- 

linked with cell membrane or extract preparations that express the receptor molecule. Cross-linked material 
is resolved by PAGE and exposed to X-ray film. The labeled complex containing the receptor can be excised, 
resolved into peptide fragments, and subjected to protein micro-sequencing. The amino acid sequence obtained 
from micro- sequencing would be used to design a set of degenerate oligonucleotide probes to screen a cDNA 

20 library to identify the gene encoding the putative receptor. 

In another assay for antagonists, mammalian cells or a membrane preparation expressing the receptor 
would be incubated with labeled TAT polypeptide in the presence of the candidate compound. The ability of 
the compound to enhance or block this interaction could then be measured. 

More specific examples of potential antagonists include an oligonucleotide that binds to the fusions of 

25 immunoglobulin with TAT polypeptide, and, in particular, antibodies including, without limitation, poly- and 
monoclonal antibodies and antibody fragments, single-chain antibodies, anti-idiotypic antibodies, and chimeric 
or humanized versions of such antibodies or fragments, as well as human antibodies and antibody fragments. 
Alternatively, a potential antagonist may be a closely related protein, for example, a mutated form of the TAT 
polypeptide that recognizes the receptor but imparts no effect, thereby competitively inhibiting the action of the 

30 TAT polypeptide. 

Another potential TAT polypeptide antagonist is an antisense RNA or DNA construct prepared using 
antisense technology, where, e.g., an antisense RNA or DNA molecule acts to block directly the translation of 
mRNA by hybridizing to targeted mRNA and preventing protein translation. Antisense technology can be used 
to control gene expression through triple-helix formation or antisense DNA or RNA, both of which methods 

35 are based on binding of a polynucleotide to DNA or RNA. For example, the 5* coding portion of the 

polynucleotide sequence, which encodes the mature TAT polypeptides herein, is used to design an antisense 
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RNA oligonucleotide of from about 10 to 40 base pairs in length. A DNA oligonucleotide is designed to be 
complementary to a region of the gene involved in transcription (triple helix - see Lee et al. , Nucl. Acids Res., 
6:3073 (1979); Cooney et al., Science. 241: 456 (1988); Dervan et al., Science, 251:1360 (1991)), thereby 
preventing transcription and the production of the TAT polypeptide. The antisense RNA oligonucleotide 
hybridizes to the mRNA in vivo and blocks translation of the mRNA molecule into the TAT polypeptide 
5 (antisense - Okano, Neurochem. . 56:560 (1991); Oligodeoxvnucleotides as Antisen se Inhibit ors of Gene 
Expression (CRC Press: Boca Raton, FL, 1988). The oligonucleotides described above can also be delivered 
to cells such that the antisense RNA or DNA may be expressed in vivo to inhibit production of the TAT 
polypeptide. When antisense DNA is used, oligodeoxyribonucleotides derived from the translation-initiation 
site, e.g., between about -10 and +10 positions of the target gene nucleotide sequence, are preferred. 
10 Potential antagonists include small molecules that bind to the active site, the receptor binding site, or 

growth factor or other relevant binding site of the TAT polypeptide, thereby blocking the normal biological 
activity of the TAT polypeptide. Examples of small molecules include, but are not limited to, small peptides 
or peptide-like molecules, preferably soluble peptides, and synthetic non-peptidyl organic or inorganic 
compounds. 

15 Ribozymes are enzymatic RNA molecules capable of catalyzing the specific cleavage of RNA. 

Ribozymes act by sequence-specific hybridization to the complementary target RNA, followed by 
endonucleolytic cleavage. Specific ribozyme cleavage sites within a potential RNA target can be identified by 
known techniques. For further details see, e.g., Ross Current Biology. 4:469-471 (1994), and PCT publication 
No. WO 97/33551 (published September 18, 1997). 

20 Nucleic acid molecules in triple-helix formation used to inhibit transcription should be single-stranded 

and composed of deoxynucleotides. The base composition of these oligonucleotides is designed such that it 
promotes triple-helix formation via Hoogsteen base-pairing rules, which generally require sizeable stretches of 
purines or pyrimidines on one strand of a duplex. For further details see, e.g., PCT publication No. WO 
97/33551, supra. 

25 These small molecules can be identified by any one or more of the screening assays discussed 

hereinabove and/or by any other screening techniques well known for those skilled in the art. 

Isolated TAT polypeptide-encoding nucleic acid can be used herein for recombinantly producing TAT 
polypeptide using techniques well known in the art and as described herein. In turn, the produced TAT 
polypeptides can be employed for generating anti-TAT antibodies using techniques well known in the art and 

30 as described herein. 
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Antibodies specifically binding a TAT polypeptide identified herein, as well as other molecules 
identified by the screening assays disclosed hereinbefore, can be administered for the treatment of various 
disorders, including cancer, in the form of pharmaceutical compositions. 

If the TAT polypeptide is intracellular and whole antibodies are used as inhibitors, internalizing 
antibodies are preferred. However, lipofections or liposomes can also be used to deliver the antibody, or an 
antibody fragment, into cells. Where antibody fragments are used, the smallest inhibitory fragment that 
specifically binds to the binding domain of the target protein is preferred. For example, based upon the 
variable-region sequences of an antibody, peptide molecules can be designed that retain the ability to bind the 
target protein sequence. Such peptides can be synthesized chemically and/or produced by recombinant DNA 
technology. See, e.g., Marasco etal., Proc. Natl. Acad. Sci. USA, 90: 7889-7893 (1993). 

The formulation herein may also contain more than one active compound as necessary for the particular 
indication being treated, preferably those with complementary activities that do not adversely affect each other. 
Alternatively, or in addition, the composition may comprise an agent that enhances its function, such as, for 
example, a cytotoxic agent, cytokine, chemotherapeutic agent, or growth-inhibitory agent. Such molecules are 
suitably present in combination in amounts that are effective for the purpose intended. 

The following examples are offered for illustrative purposes only, and are not intended to limit the 
scope of the present invention in any way. 

All patent and literature references cited in the present specification are hereby incorporated by 
reference in their entirety. 

EXAMPLES 

Commercially available reagents referred to in the examples were used according to manufacturer's 
instructions unless otherwise indicated. The source of those cells identified in the following examples, and 
throughout the specification, by ATCC accession numbers is the American Type Culture Collection, Manassas, 
VA. 

EXAMPLE 1: Analysis of Differential TAT Polypeptide Expression bv GEPIS 

An expressed sequence tag (EST) DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) 
was searched and interesting EST sequences were identified by GEPIS. Gene expression profiling in silico 
(GEPIS) is abioinformatics tool developed at Genentech, Inc. that characterizes genes of interest for new cancer 
therapeutic targets. GEPIS takes advantage of large amounts of EST sequence and library information to 
determine gene expression profiles. GEPIS is capable of determining the expression profile of a gene based 
upon its proportional correlation with the number of its occurrences in EST databases, and it works by 
integrating the LIFESEQ® EST relational database and Genentech proprietary information in a stringent and 
statistically meaningful way. In this example, GEPIS is used to identify and cross-validate novel tumor 
antigens, although GEPIS can be configured to perform either very specific analyses or broad screening tasks. 
For the initial screen, GEPIS is used to identify EST sequences from the LIFESEQ® database that correlate 
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to expression in a particular tissue or tissues of interest (often a tumor tissue of interest). Then, GEPIS was 
employed to generate a complete tissue expression profile for the various sequences of interest. Using this type 
of screening bioinformatics, various TAT polypeptides (jand their encoding nucleic acid molecules) were 
identified as being significantly overexpressed in a particular type of cancer or certain cancers as compared to 
other cancers and/or normal non-cancerous tissues. The rating of GEPIS hits is based upon several criteria 
including, for example, tissue specificity, tumor specificity and expression level in normal essential and/or 
normal proliferating tissues. The following is a list of molecules whose tissue expression profile as determined 
by GEPIS evidences significant upregulation of expression in a specific tumor or tumors as compared to other 
tumor(s) and/or normal tissues and optionally relatively low expression in normal essential and/or normal 
proliferating tissues. 

Under each tissue heading shown below is a list of the cDNA sequences that are detectably 
overexpressed in tumor tissue of the indicated tissue type as compared to normal non-tumor tissue of the same 
tissue type. As such, the molecules listed below (and the polypeptides they encode) are excellent nucleic acid 
(and polypeptide) targets for the diagnosis and therapy of cancer in mammals. 



PERIPHERAL NERVOUS SYSTEM 
DNA324303 DNA324573 DNA324681 
DNA325408 DNA325409 DNA325410 
DNA326231 DNA188229 DNA327080 



DNA325296 DNA325405 DNA325407 
DNA325449 DNA325503 DNA326083 
DNA327081 DNA327082 



BRAIN 

DNA323721 

DNA323728 

DNA323740 

DNA323755 

DNA323781 

DNA323805 

DNA323817 

DNA323825 

DNA103214 

DNA323856 

DNA323882 

DNA323898 

DNA323912 

DNA323925 

DNA323937 

DNA294794 



DNA323722 
DNA323729 
DNA323742 
DNA323757 
DNA323783 
DNA323810 
DNA323821 
DNA323826 
DNA323834 
DNA323859 
DNA323887 
DNA323900 
DNA323918 
DNA323926 
DNA323938 
DNA323943 



DNA323723 
DNA323731 
DNA323743 
DNA323759 
DNA323785 
DNA323811 
DNA273060 
DNA323828 
DNA323837 
DNA323863 
DNA323888 
DNA323901 
DNA323921 
DNA257916 
DNA323939 
DNA323944 



DNA323724 
DNA323732 
DNA323744 
DNA323764 
DNA323795 
DNA323812 
DNA323823 
DNA323829 
DNA323838 
DNA323869 
DNA323892 
DNA323902 
DNA323922 
DNA323927 
DNA323940 
DNA323946 



DNA323726 
DNA287173 
DNA323751 
DNA323765 
DNA323796 
DNA323814 
DNA323824 
DNA323830 
DNA323839 
DNA323871 
DNA323893 
DNA323908 
DNA323923 
DNA323931 
DNA323942 
DNA323947 



DNA323727 

DNA151148 

DNA323753 

DNA323778 

DNA323797 

DNA83085 

DNA256503 

DNA323833 

DNA323846 

DNA323874 

DNA323897 

DNA210134 

DNA323924 

DNA323936 

DNA226793 

DNA323950 
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DNA323951 


DNA103436 


DNA323953 


DNA323958 


DNA323959 


DNA323961 




DNA226619 


DNA323962 


DNA323964 


DNA323969 


DNA323970 


DNA323973 




DNA323974 


DNA323975 


DNA323976 


DNA323977 


DNA323979 


DNA323980 




DNA323991 


DNA323992 


DNA323994 


DNA323995 


DNA324000 


DNA324001 




DNA324002 


DNA324003 


DNA227246 


DNA324004 


DNA324008 


DNA324009 


5 


DNA324010 


DNA324011 


DNA324012 


DNA196344 


DNA193882 


DNA324024 




DNA324034 


DNA324037 


DNA324042 


DNA324046 


DNA324047 


DNA324048 




DNA324050 


DNA324051 


DNA324055 


DNA275195 


DNA324059 


DNA324060 




DNA275049 


DNA324063 


DNA324065 


DNA324066 


DNA324067 


DNA324071 




DNA324072 


DNA324073 


DNA227165 


DNA324074 


DNA324076 


DNA324077 


10 


DNA324078 


DNA324079 


DNA324080 


DNA271243 


DNA324081 


DNA324082 




DNA324084 


DNA324088 


DNA324090 


DNA324091 


DNA324092 


DNA324099 




DNA324101 


DNA324106 


DNA324109 


DNA324111 


DNA324112 


DNA324121 




DNA324122 


DNA324123 


DNA324128 


DNA324129 


DNA227795 


DNA324130 




DNA324131 


DNA324132 


DNA324133 


DNA227528 


DNA324134 


DNA150725 


15 


DNA324136 


DNA324138 


DNA324139 


DNA324141 


DNA324146 


DNA324152 




DNA324153 


DNA324155 


DNA324159 


DNA324160 


DNA324161 


DNA324162 




DNA194740 


DNA324166 


DNA324175 


DNA324176 


DNA272127 


DNA324177 




DNA324182 


DNA324184 


DNA324186 


DNA324188 


DNA324194 


DNA324197 




DNA324198 


DNA324203 


DNA324204 


DNA324207 


DNA324209 


DNA324210 


20 


DNA324216 


DNA324218 


DNA324220 


DNA324221 


DNA324222 


DNA324223 




DNA324224 


DNA324227 


DNA324228 


DNA194827 


DNA324230 


DNA324231 




DNA324233 


DNA324234 


DNA324235 


DNA324237 


DNA324239 


DNA254204 




DNA324240 


DNA189697 


DNA324243 


DNA324246 


DNA324251 


DNA324253 




DNA150884 


DNA324256 


DNA324258 


DNA324260 


DNA324262 


DNA324264 


25 


DNA324269 


DNA324270 


DNA324271 


DNA324274 


DNA324275 


DNA269910 




DNA324279 


DNA324285 


DNA324286 


DNA324288 


DNA324290 


DNA270401 




DNA226547 


DNA324295 


DNA324296 


DNA324299 


DNA324300 


DNA324304 




DNA324305 


DNA324308 


DNA324309 


DNA324310 


DNA324313 


DNA324314 




DNA324315 


DNA324316 


DNA324317 


DNA103505 


DNA324318 


DNA324319 


30 


DNA324320 


DNA324323 


DNA324327 


DNA324328 


DNA324329 


DNA324330 




DNA324331 


DNA324333 


DNA324336 


DNA324338 


DNA324342 


DNA324343 




DNA324353 


DNA88547 


DNA324356 


DNA324358 


DNA324359 


DNA324361 




DNA324363 


DNA324364 


DNA324365 


DNA324366 


DNA324367 


DNA324368 




DNA324369 


DNA324371 


DNA324377 


DNA324387 


DNA324388 


DNA324389 


35 


DNA324390 


DNA324397 


DNA324398 


DNA324410 


DNA324411 


DNA324412 




DNA324413 


DNA254620 


DNA324415 


DNA324417 


DNA324418 


DNA89239 
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DNA324420 


DNA225592 


DNA324422 


DNA324428 


DNA324429 


DNA324434 




DNA324435 


DNA324437 


DNA324441 


DNA324442 


DNA324443 


DNA324448 




DNA324449 


DNA324457 


DNA324465 


DNA324466 


DNA324467 


DNA324472 




DNA257511 


DNA324483 


DNA324485 


DNA324486 


DNA225919 


DNA324487 




DNA324491 


DNA324495 


DNA324496 


DNA324497 


DNA324498 


DNA324510 


5 


DNA324512 


DNA324513 


DNA324516 


DNA324518 


DNA324519 


DNA324521 




DNA324524 


DNA324525 


DNA227575 


DNA324526 


DNA225920 


DNA324527 




DNA225921 


DNA324528 


DNA324531 


DNA324532 


DNA324533 


DNA324534 




DNA324538 


DNA324540 


DNA324541 


DNA324542 


DNA324545 


DNA324546 




DNA324548 


DNA324558 


DNA324559 


DNA324564 


DNA324577 


DNA324578 


10 


DNA288259 


DNA324590 


DNA324591 


DNA324595 


DNA324596 


DNA324597 




DNA324600 


DNA324604 


DNA324605 


DNA324613 


DNA324614 


DNA324615 




DNA324616 


DNA324618 


DNA324619 


DNA324620 


DNA324624 


DNA324625 




DNA83020 


DNA324626 


DNA103380 


DNA226872 


DNA324632 


DNA324640 




DNA324642 


DNA324643 


DNA324645 


DNA324646 


DNA324647 


DNA324649 


15 


DNA324651 


DNA324652 


DNA324653 


DNA150679 


DNA324654 


DNA324655 




DNA324656 


DNA324657 


DNA324658 


DNA324659 


DNA324660 


DNA324661 




DNA324662 


DNA324663 


DNA324664 


DNA324665 


DNA324666 


DNA324667 




DNA324668 


DNA324669 


DNA324670 


DNA324671 


DNA324672 


DNA324673 




DNA324674 


DNA324675 


DNA324676 


DNA324678 


DNA324681 


DNA324682 


20 


DNA324685 


DNA324686 


DNA324691 


DNA324694 


DNA324696 


DNA324697 




DNA324698 


DNA324700 


DNA324701 


DNA324702 


DNA324704 


DNA324705 




DNA225909 


DNA274206 


DNA324706 


DNA324707 


DNA324710 


DNA324711 




DNA324714 


DNA324715 


DNA324716 


DNA270675 


DNA324717 


DNA269593 




DNA324718 


DNA324719 


DNA324720 


DNA324721 


DNA272171 


DNA324728 


25 


DNA324729 


DNA304680 


DNA324730 


DNA324734 


DNA324736 


DNA324737 




DNA227204 


DNA324738 


DNA324740 


DNA287246 


DNA324743 


DNA324745 




DNA304716 


DNA324748 


DNA324749 


DNA324750 


DNA324751 


DNA324755 




DNA324756 


DNA324757 


DNA324758 


DNA227442 


DNA324766 


DNA324767 




DNA324768 


DNA324769 


DNA287227 


DNA324771 


DNA324772 


DNA324773 


30 


DNA324774 


DNA272263 


DNA287319 


DNA324777 


DNA324778 


DNA324779 




DNA324782 


DNA324784 


DNA324785 


DNA324786 


DNA324787 


DNA271040 




DNA324789 


DNA324791 


DNA324792 


DNA324794 


DNA324796 


DNA324797 




DNA324798 


DNA324799 


DNA324803 


DNA324804 


DNA324805 


DNA324809 




DNA324810 


DNA324812 


DNA324817 


PNA324819 


DNA324820 


DNA324821 


35 


DNA324826 


DNA324830 


DNA324836 


DNA324837 


DNA324838 


DNA324840 




DNA324841 


DNA324842 


DNA324844 


DNA324853 


DNA324866 


DNA324873 
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DNA324876 

DNA324886 

DNA225631 

DNA324903 

DNA324918 

DNA324929 

DNA324944 

DNA324960 

DNA324968 

DNA324978 

DNA324988 

DNA324999 

DNA325014 

DNA325027 

DNA325040 

DNA325047 

DNA325065 

DNA325072 

DNA325082 

DNA325103 

DNA325117 

DNA325136 

DNA325143 

DNA325150 

DNA325157 

DNA325166 

DNA325173 

DNA325182 

DNA325197 

DNA325204 

DNA289530 

DNA325219 

DNA325226 

DNA325240 

DNA325252 

DNA325264 

DNA325270 



DNA324877 

DNA324889 

DNA274326 

DNA324906 

DNA324920 

DNA273865 

DNA324945 

DNA304710 

DNA324969 

DNA324979 

DNA324989 

DNA325002 

DNA325015 

DNA325032 

DNA325041 

DNA325050 

DNA274178 

DNA325073 

DNA325083 

DNA325105 

DNA325118 

DNA325137 

DNA325144 

DNA325151 

DNA325160 

DNA325167 

DNA325174 

DNA325184 

DNA325199 

DNA257309 

DNA287271 

DNA325220 

DNA325229 

DNA325243 

DNA325253 

DNA325265 

DNA325271 



DNA324878 

DNA324890 

DNA324895 

DNA324907 

DNA324922 

DNA324931 

DNA324947 

DNA324962 

DNA324972 

DNA324980 

DNA324990 

DNA325005 

DNA325019 

DNA325033 

DNA325043 

DNA325052 

DNA325069 

DNA225671 

DNA325084 

DNA325106 

DNA325119 

DNA325138 

DNA325145 

DNA325152 

DNA325161 

DNA325168 

DNA325181 

DNA325187 

DNA325200 

DNA325206 

DNA325214 

DNA325221 

DNA88350 

DNA325246 

DNA325257 

DNA325266 

DNA325273 



DNA324879 

DNA324891 

DNA324896 

DNA324908 

DNA275334 

DNA324932 

DNA324952 

DNA324963 

DNA324973 

DNA324982 

DNA324996 

DNA325006 

DNA325020 

DNA325034 

DNA325044 

DNA325054 

DNA83022 

DNA325075 

DNA325085 

DNA325111 

DNA325126 

DNA325139 

DNA325146 

DNA325153 

DNA325163 

DNA325170 

DNA227491 

DNA325190 

DNA272213 

DNA325209 

DNA325216 

DNA325222 

DNA325235 

DNA325247 

DNA325258 

DNA325267 

DNA325274 



DNA324884 

DNA324892 

DNA324899 

DNA324916 

DNA324924 

DNA304707 

DNA324953 

DNA324965 

DNA324974 

DNA324984 

DNA324997 

DNA325012 

DNA325024 

DNA325035 

DNA325045 

DNA325062 

DNA325070 

DNA325076 

DNA325088 

DNA325112 

DNA325128 

DNA325140 

DNA325147 

DNA325155 

DNA325164 

DNA325171 

DNA254771 

DNA272655 

DNA325202 

DNA325211 

DNA325217 

DNA218841 

DNA325236 

DNA325249 

DNA325261 

DNA325268 

DNA325275 



DNA324885 

DNA324894 

DNA324902 

DNA324917 

DNA324925 

DNA324938 

DNA324955 

DNA324966 

DNA324977 

DNA272090 

DNA324998 

DNA325013 

DNA325026 

DNA325037 

DNA325046 

DNA325064 

DNA325071 

DNA227267 

DNA325102 

DNA325116 

DNA325132 

DNA325141 

DNA325148 

DNA325156 

DNA325165 

DNA226345 

DNA89242 

DNA275322 

DNA325203 

DNA325212 

DNA325218 

DNA325223 

DNA325237 

DNA325250 

DNA325262 

DNA325269 

DNA325276 
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DNA325278 


DNA325279 


DNA325283 


DNA325288 


DNA325290 


DNA325292 




DNA325293 


DNA325296 


DNA325301 


DNA325302 


DNA325303 


DNA325304 




DNA325307 


DNA325309 


DNA325310 


DNA325312 


DNA325314 


DNA325315 




DNA325316 


DNA325318 


DNA325319 


DNA325320 


DNA325322 


DNA325324 




DNA 193957 


DNA325325 


DNA325326 


DNA325328 


DNA325329 


DNA325331 


5 


DNA325333 


DNA325334 


DNA325335 


DNA325336 


DNA325337 


DNA325338 




DNA325341 


DNA304459 


DNA325342 


DNA325343 


DNA325344 


DNA325346 




DNA325347 


DNA325348 


DNA325349 


DNA325355 


DNA325360 


DNA325361 




DNA325362 


DNA325363 


DNA325364 


DNA325365 


DNA325369 


DNA325372 




DNA325375 


DNA325381 


DNA325384 


DNA325385 


DNA325393 


DNA325395 


10 


DNA269952 


DNA325396 


DNA325397 


DNA325400 


DNA325402 


DNA325403 




DNA325404 


DNA325405 


DNA325407 


DNA325408 


DNA325409 


DNA325410 




DNA325413 


DNA325414 


DNA325415 


DNA325417 


DNA325418 


DNA325423 




DNA325425 


DNA325426 


DNA325430 


DNA325434 


DNA97285 


DNA325446 




DNA325451 


DNA325452 


DNA325453 


DNA325456 


DNA325457 


DNA150974 


15 


DNA325458 


DNA287417 


DNA227088 


DNA325462 


DNA325464 


DNA325465 




DNA325466 


DNA325469 


DNA287254 


DNA325471 


DNA325474 


DNA325476 




DNA325477 


DNA325479 


DNA325480 


DNA325481 


DNA325482 


DNA325483 




DNA325484 


DNA325489 


DNA325491 


DNA325492 


DNA325493 


DNA325495 




DNA325496 


DNA325497 


DNA325498 


DNA269803 


DNA325500 


DNA325501 


20 


DNA325503 


DNA325505 


DNA270721 


DNA189687 


DNA325506 


DNA325511 




DNA325512 


DNA325513 


DNA103474 


DNA325514 


DNA325516 


DNA325517 




DNA325518 


DNA325519 


DNA325520 


DNA325521 


DNA325522 


DNA325523 




DNA88176 


DNA325529 


DNA325530 


DNA325534 


DNA325535 


DNA325539 




DNA325540 


DNA325541 


DNA325544 


DNA325545 


DNA325546 


DNA325547 


25 


DNA325549 


DNA225752 


DNA325551 


DNA325553 


DNA325554 


DNA325557 




DNA325561 


DNA325563 


DNA325566 


DNA325568 


DNA325571 


DNA325572 




DNA325573 


DNA325574 


DNA325575 


DNA325579 


DNA325580 


DNA325583 




DNA325585 


DNA325586 


DNA325587 


DNA88114 


DNA325592 


DNA325593 




DNA325596 


DNA325597 


DNA325600 


DNA325601 


DNA225632 


DNA83180 


30 


DNA325603 


DNA325608 


DNA325618 


DNA150997 


DNA325625 


DNA325631 




DNA325636 


DNA325638 


DNA325639 


DNA325642 


DNA325643 


DNA325649 




DNA325650 


DNA325651 


DNA325652 


DNA325653 


DNA325654 


DNA325655 




DNA325656 


DNA325657 


DNA325658 


DNA325659 


DNA325660 


DNA325661 




DNA325664 


DNA270458 


DNA227092 


DNA325665 


DNA325669 


DNA325670 


35 


DNA325673 


DNA325674 


DNA325675 


DNA325676 


DNA325677 


DNA325679 




DNA325680 


DNA325681 


DNA325683 


DNA325684 


DNA325687 


DNA325688 
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DNA325689 


DNA325690 


DNA325691 


DNA325695 


DNA325698 


DNA325702 




DNA325706 


DNA79101 


DNA325709 


DNA325711 


DNA325712 


DNA325717 




DNA325720 


DNA325721 


DNA325723 


DNA325724 


DNA325731 


DNA226014 




DNA325733 


DNA325736 


DNA325739 


DNA325747 


DNA325750 


DNA325752 




DNA325755 


DNA325758 


DNA325761 


DNA325762 


DNA325763 


DNA325766 


5 


DNA325768 


DNA325773 


DNA325775 


DNA325776 


DNA325782 


DNA325786 




DNA325787 


DNA302016 


DNA325789 


DNA325793 


DNA325794 


DNA325796 




DNA325797 


DNA325802 


DNA325806 


DNA325807 


DNA325808 


DNA325809 




DNA226853 


DNA325811 


DNA325812 


DNA325814 


DNA325818 


DNA325819 




DNA270254 


DNA281436 


DNA325837 


DNA325838 


DNA325840 


DNA325843 


10 


DNA325844 


DNA325850 


DNA325851 


DNA325852 


DNA325855 


DNA325856 




DNA325858 


DNA325859 


DNA325870 


DNA325875 


DNA325878 


DNA325885 




DNA325895 


DNA3259Q2 


DNA225649 


DNA325913 


DNA325915 


DNA325918 




DNA325919 


DNA325922 


DNA325924 


DNA325928 


DNA325932 


DNA325935 




DNA325938 


DNA325942 


DNA325943 


DNA325946 


DNA325947 


DNA325949 


15 


DNA325950 


DNA325951 


DNA325956 


DNA325960 


DNA325974 


DNA325975 




DNA325976 


DNA325977 


DNA325980 


DNA325981 


DNA325985 


DNA325986 




DNA325991 


DNA325992 


DNA325994 


DNA325995 


DNA325996 


DNA326002 




DNA326003 


DNA326005 


DNA326006 


DNA326007 


DNA326010 


DNA326011 




DNA226646 


DNA326022 


DNA287331 


DNA326024 


DNA326025 


DNA326026 


20 


DNA326028 


DNA326029 


DNA326030 


DNA326032 


DNA326034 


DNA326038 




DNA326039 


DNA326040 


DNA326041 


DNA326042 


DNA326046 


DNA326047 




DNA326049 


DNA326052 


DNA326053 


DNA326057 


DNA326061 


DNA326062 




DNA326064 


DNA326066 


DNA326068 


DNA275181 


DNA326069 


DNA326071 




DNA326075 


DNA326076 


DNA326078 


DNA326079 


DNA326080 


DNA326085 


25 


DNA326086 


DNA326087 


DNA326091 


DNA273839 


DNA256844 


DNA326092 




DNA326093 


DNA256886 


DNA326095 


DNA254781 


DNA326096 


DNA326097 




DNA326098 


DNA326099 


DNA326100 


DNA326102 


DNA326103 


DNA326109 




DNA326110 


DNA326111 


DNA326112 


DNA326113 


DNA326114 


DNA326115 




DNA326116 


DNA326117 
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DNA326687 


DNA326688 


DNA326691 


DNA326692 




DNA326698 


DNA290260 


DNA304658 


DNA326762 


DNA326769 


DNA326790 




DNA326791 


DNA326792 


DNA326796 


DNA326798 


DNA326837 


DNA326854 




DNA326858 


DNA326884 


DNA326885 


DNA326886 


DNA326940 


DNA326941 




DNA269830 


DNA254240 


DNA326974 


DNA327005 


DNA327019 


DNA327020 


1 c 
ID 






TYM A 

XJrSJ\DZ /uzo 




din Ajz /uzy 






DNA327044 


DNA327060 


DNA327062 


DNA273254 


DNA327066 


DNA327067 




DNA327072 


DNA327077 


DNA327078 


DNA327079 


DNA327083 


DNA327084 




DNA327098 


DNA327100 


DNA327114 








20 


CERVIX 














DNA324417 


DNA324418 
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DNA326931 DNA326932 DNA326935 
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DNA327099 DNA327114 DNA103558 
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DNA324340 
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DNA324642 
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DNA324021 


DNA324033 


DNA324040 


DNA324041 


DNA324052 


DNA324240 
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DNA325535 


DNA325550 


DNA325569 


DNA325570 


DNA325584 


DNA325593 




DNA325595 


DNA151827 


DNA325601 


DNA225632 


DNA103514 


DNA325604 


15 


DNA325618 


DNA325625 


DNA325633 


DNA325634 


DNA271344 


DNA325642 




DNA325644 


DNA325645 


DNA325658 


DNA325659 


DNA325660 


DNA325662 




DNA270458 


DNA227092 


DNA325674 


DNA325680 


DNA325686 


DNA325695 




DNA325704 


DNA325711 


DNA325712 


DNA325720 


DNA325731 


DNA325750 




DNA325752 


DNA325755 


DNA325757 


DNA325758 


DNA325773 


DNA325775 


20 


DNA325776 


DNA325786 


DNA302016 


DNA325789 


DNA325806 


DNA325809 




DNA325810 


DNA325811 


DNA325812 


DNA325814 


DNA325818 


DNA325822 




DNA325837 


DNA325838 


DNA325843 


DNA325844 


DNA325864 


DNA325891 




DNA325894 


DNA325913 


DNA325920 


DNA269498 


DNA325923 


DNA325933 




DNA325935 


DNA325945 


DNA103509 


DNA325952 


DNA325953 


DNA325957 


25 


DNA325958 


DNA325965 


DNA325985 


DNA325988 


DNA325994 


DNA326002 




DNA226646 


DNA326022 


DNA287331 


DNA326041 


DNA326046 


DNA326047 




DNA326099 


DNA326102 


DNA326116 


DNA326121 


DNA326122 


DNA326124 




DNA326128 


DNA326129 


DNA326133 


DNA289522 


DNA326136 


DNA326146 




DNA326155 


DNA326156 


DNA326168 


DNA326169 


DNA287355 


DNA326177 


30 


DNA326186 


DNA326194 


DNA326214 


DNA326230 


DNA326233 


DNA326234 




DNA326256 


DNA326260 


DNA97300 


DNA326273 


DNA326278 


DNA326279 




DNA326287 


DNA326288 


DNA326289 


DNA326291 


DNA326292 


DNA326296 




DNA326297 


DNA326300 


DNA326309 


DNA326311 


DNA326330 


DNA272889 




DNA270975 


DNA326347 


DNA270901 


DNA326381 


DNA326384 


DNA326396 


35 


DNA326404 


DNA129504 


DNA326414 


DNA326415 


DNA326416 


DNA326426 




DNA326427 


DNA326429 


DNA326430 


DNA326432 


DNA326433 


DNA326440 
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DNA326441 DNA326442 

DNA326452 DNA326453 

DNA326463 DNA326479 

DNA326487 DNA326499 

DNA326559 DNA326562 

DNA326583 DNA326584 

DNA326603 DNA326615 

DNA326642 DNA326651 

DNA326676 DNA326683 

DNA326690 DNA326691 

DNA326726 DNA326727 

DNA326741 DNA326742 

DNA254548 DNA326769 

DNA326787 DNA326789 

DNA3268 19 DNA2735 17 

DNA326885 DNA326886 

DNA326937 DNA269830 



DNA326981 


DNA326983 


DNA327033 


DNA327054 


DNA327078 


DNA327079 


BREAST 




DNA323717 


DNA273712 


DNA323805 


DNA323817 


DNA323858 


DNA323859 


DNA323869 


DNA323870 


DNA323936 


DNA323943 


DNA323980 


DNA323990 


DNA324042 


DNA324047 


DNA324091 


DNA324092 


DNA324112 


DNA227795 


DNA324159 


DNA324170 


DNA324207 


DNA324210 


DNA324243 


DNA324276 


DNA324320 


DNA324338 


DNA324373 


DNA324390 


DNA324418 


DNA324423 



DNA326446 DNA326449 

DNA326454 DNA271841 

DNA326481 DNA326482 

DNA326512 DNA287636 

DNA326573 DNA326579 

DNA326585 DNA274034 

DNA326625 DNA326626 

DNA326657 DNA326660 

DNA326684 DNA326685 

DNA326692 DNA326698 

DNA326731 DNA290260 

DNA326756 DNA326758 

DNA326773 DNA287270 

DNA326798 DNA326801 

DNA194701 DNA103525 

DNA254572 DNA326901 

DNA326952 DNA326953 

DNA327005 DNA327023 

DNA327060 DNA327067 

DNA327085 DNA327111 



DNA226262 DNA323778 

DNA323820 DNA323829 

DNA323862 DNA323863 

DNA323871 DNA323872 

DNA323944 DNA323947 

DNA323998 DNA324004 

DNA324054 DNA324063 

DNA324101 DNA324103 

DNA324134 DNA227190 

DNA324178 DNA324189 

DNA324218 DNA324224 

DNA324285 DNA226547 

DNA324340 DNA324341 

DNA324391 DNA324394 

DNA324434 DNA324437 



DNA326450 DNA32645 1 

DNA326457 DNA326459 

DNA326484 DNA326485 

DNA326516 DNA326523 

DNA326581 DNA326582 

DNA326596 DNA326597 

DNA326633 DNA326634 

DNA326661 DNA274139 

DNA326687 DNA326688 

DNA326702 DNA103580 

DNA326736 DNA326739 

DNA326761 DNA273346 

DNA326781 DNA326782 

DNA326808 DNA326818 

DNA326844 DNA326884 

DNA326902 DNA326921 

DNA326972 DNA326974 

DNA327025 DNA327029 

DNA327068 DNA327077 
DNA227013 



DNA323784 DNA323804 

DNA323836 DNA323845 

DNA323867 DNA323868 

DNA323919 DNA323922 

DNA323953 DNA323964 

DNA324009 DNA324013 

DNA324075 DNA324090 

DNA3241 10 DNA3241 1 1 

DNA324149 DNA324154 

DNA324192 DNA324193 

DNA324230 DNA324236 

DNA324295 DNA150976 

DNA324346 DNA324347 

DNA324412 DNA324417 

DNA324438 DNA139747 
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DNA253804 


DNA324471 


DNA324472 


DNA324478 


DNA324479 


DNA324483 


DNA324489 


DNA324495 


DNA324502 


DNA324503 


DNA324506 


DNA324509 


DNA324511 


DNA324512 


DNA225584 


DNA324517 


DNA324521 


DNA324525 


DNA324549 


DNA324550 


DNA324551 


DNA324554 


DNA324561 


DNA324564 


DNA324565 


DNA324568 


DNA324574 


DNA324576 


DNA324577 


DNA324579 


DNA324591 


DNA324592 


DNA324595 


DNA324596 


DNA324597 


DNA324599 


DNA324600 


DNA324601 


DNA324605 


DNA324613 


DNA324624 


DNA103380 


DNA324632 


DNA324633 


DNA324641 


DNA324643 


DNA324645 


DNA324679 


DNA324682 


DNA324684 


DNA324685 


DNA324690 


DNA324712 


DNA324714 


DNA324717 


DNA324720 


DNA324727 


DNA304680 


DNA324736 


DNA324737 


DNA324746 


DNA324749 


DNA324751 


DNA324755 


DNA304661 


DNA287227 


DNA324773 


DNA324785 


DNA324790 


DNA324796 


DNA324797 


DNA324807 


DNA324810 


DNA324811 


DNA324824 


DNA324827 


DNA324841 


DNA324844 


DNA324858 


DNA324866 


DNA324874 


DNA324878 


DNA324879 


DNA225631 


DNA324902 


DNA324905 


DNA324910 


DNA324928 


DNA324945 


DNA304710 


DNA324963 


DNA324966 


DNA324967 


DNA324968 


DNA304801 


DNA272090 


DNA324987 


DNA324989 


DNA325000 


DNA325006 


DNA325010 


DNA325015 


DNA325024 


DNA325026 


DNA325027 


DNA325034 


DNA325078 


DNA325079 


DNA325080 


DNA325081 


DNA325099 


DNA325101 


DNA325103 


DNA325104 


DNA325106 


DNA325111 


DNA325113 


DNA325116 


DNA325117 


DNA325118 


DNA325119 


DNA325120 


DNA325121 


DNA325123 


DNA325127 


DNA325141 


DNA325152 


DNA325153 


DNA325155 


DNA325156 


DNA325157 


DNA325162 


DNA325164 


DNA325179 


DNA325180 


DNA325182 


DNA325183 


DNA325184 


DNA325190 


DNA325200 


DNA325202 


DNAi25206 


DNA325209 


DNA325222 


DNA325229 


DNA325231 


DNA325232 


DNA325234 


DNA325250 


DNA325278 


DNA325291 


DNA325292 


DNA325295 


DNA325301 


DNA325326 


DNA325339 


DNA325340 


DNA325343 


DNA325344 


DNA325346 


DNA325347 


DNA325356 


DNA325358 


DNA325374 


DNA325381 


DNA325386 


DNA325389 


DNA325391 


DNA325395 


DNA325428 


DNA325430 


DNA325431 


DNA325436 


DNA325437 


DNA97285 


DNA325442 


DNA325451 


DNA325452 


DNA75863 


DNA325475 


DNA325483 


DNA325523 


DNA325525 


DNA325528 


DNA325535 


DNA325549 


DNA325576 


DNA325584 


DNA325596 


DNA325601 


DNA225632 


DNA325618 


DNA325625 


DNA325633 


DNA325642 


DNA325644 


DNA325645 


DNA325662 


DNA270458 


DNA227092 


DNA325674 


DNA325680 


DNA325696 


DNA325697 


DNA325711 


DNA325712 


DNA325731 


DNA325736 


DNA325757 


DNA325762 


DNA325765 


DNA325783 


DNA325786 


DNA302016 


DNA325789 


DNA325804 


DNA325806 


DNA325809 


DNA325810 


DNA3258U 


DNA325812 


DNA325814 
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DNA325837 

DNA325900 

DNA325930 

DNA325986 

DNA325998 

DNA326047 

DNA97293 

DNA326156 

DNA326254 

DNA326281 

DNA326291 

DNA326364 

DNA326450 

DNA326463 

DNA270315 

DNA326615 

DNA326651 

DNA326688 

DNA83154 

DNA287270 

DNA194701 

DNA326864 

DNA326921 

DNA327016 

DNA327062 

DNA327087 

STOMACH 
DNA287173 
DNA323873 
DNA324028 
DNA227795 
DNA324244 
DNA324418 
DNA324556 
DNA324769 
DNA324907 



DNA325838 
DNA325906 
DNA325933 
DNA227206 
DNA326000 
DNA326075 
DNA326122 
DNA287355 
DNA326260 
DNA304715 
DNA326292 
DNA326378 
DNA326451 
DNA326469 
DNA326546 
DNA326620 
DNA326657 
DNA326698 
DNA326756 
DNA326792 
DNA103525 
DNA326866 
DNA326952 
DNA327023 
DNA273254 
DNA327090 



DNA323805 
DNA323884 
DNA324029 
DNA324155 
DNA324294 
DNA324471 
DNA324558 
DNA324790 
DNA324908 



DNA325839 

DNA325907 

DNA325935 

DNA325990 

DNA326002 

DNA326079 

DNA326124 

DNA326187 

DNA97300 

DNA326282 

DNA66475 

DNA326381 

DNA326452 

DNA326499 

DNA326557 

DNA227249 

DNA272347 

DNA326732 

DNA326758 

DNA326796 

DNA326841 

DNA326870 

DNA326969 

DNA327025 

DNA327067 

DNA327092 



DNA323849 
DNA323920 
DNA324039 
DNA324179 
DNA324362 
DNA324504 
DNA324624 
DNA324808 
DNA324922 



DNA325843 
DNA325908 
DNA325966 
DNA325991 
DNA326022 
DNA326099 
DNA326128 
DNA326233 
DNA326273 
DNA326286 
DNA326324 
DNA326396 
DNA326453 
DNA287636 
DNA326559 
DNA326633 
DNA326669 
DNA290260 
DNA326759 
DNA326798 
DNA326862 
DNA326885 
DNA326971 
DNA327029 
DNA327068 
DNA276159 



DNA323864 
DNA323925 
DNA324048 
DNA324180 
DNA324364 
DNA324541 
DNA324630 
DNA324850 
DNA304710 



DNA325844 
DNA325913 
DNA227559 
DNA219233 
DNA326041 
DNA326113 
DNA326129 
DNA326234 
DNA326278 
DNA290292 
DNA326326 
DNA326415 
DNA326454 
DNA326529 
DNA326562 
DNA326634 
DNA326686 
DNA326741 
DNA326769 
DNA326799 
DNA326863 
DNA326886 
DNA326974 
DNA273992 
DNA327073 
DNA327127 



DNA323865 
DNA323934 
DNA324065 
DNA324216 
DNA324398 
DNA324552 
DNA304680 
DNA225631 
DNA324962 



DNA325848 
DNA325922 
DNA325985 
DNA325994 
DNA326046 
DNA326115 
DNA326136 
DNA326251 
DNA326280 
DNA326289 
DNA326327 
DNA326449 
DNA326457 
DNA326541 
DNA326579 
DNA326635 
DNA326687 
DNA326742 
DNA326777 
DNA326816 
DNA304670 
DNA326903 
DNA326981 
DNA327060 
DNA327085 



DNA323866 
DNA323990 
DNA227545 
DNA324243 
DNA324417 
DNA324555 
DNA324756 
DNA324906 
DNA324963 
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DNA324972 

DNA325078 

DNA325149 

DNA325192 

DNA325251 

DNA325320 

DNA325444 

DNA325535 

DNA325645 

DNA325803 

DNA325986 

DNA326196 

DNA326427 

DNA326750 

DNA304670 

DNA326983 

DNA327116 



DNA324973 
DNA325079 
DNA325156 
DNA325202 
DNA325262 
DNA325368 
DNA325446 
DNA325570 
DNA270458 
DNA325804 
DNA325993 
DNA326284 
DNA326517 
DNA326791 
DNA326864 
DNA327040 
DNA327127 



DNA324982 
DNA325104 
DNA325157 
DNA325224 
DNA325268 
DNA325418 
DNA325474 
DNA325601 
DNA227092 
DNA274058 
DNA326019 
DNA326311 
DNA326603 
DNA326846 
DNA326865 
DNA327042 



DNA324997 

DNA325105 

DNA89242 

DNA325233 

DNA325306 

DNA97285 

DNA325480 

DNA225632 

DNA325773 

DNA325843 

DNA287331 

DNA326333 

DNA326641 

DNA326859 

DNA326918 

DNA327055 



DNA325033 
DNA325106 
DNA325186 
DNA325235 
DNA325316 
DNA325441 
DNA325506 
DNA325642 
DNA325775 
DNA325873 
DNA326043 
DNA326347 
DNA326642 
DNA326862 
DNA326961 
DNA273254 



DNA325074 
DNA325148 
DNA325191 
DNA325236 
DNA325318 
DNA325442 
DNA325534 
DNA325644 
DNA325776 
DNA325941 
DNA326133 
DNA326397 
DNA326698 
DNA326863 
DNA326977 
DNA327099 



BONE 

DNA323765 

DNA323869 

DNA324009 

DNA324154 

DNA324293 

DNA324417 

DNA324488 

DNA324512 

DNA324551 

DNA324575 

DNA324613 

DNA324687 

DNA304661 

DNA324829 

DNA324906 

DNA325027 

DNA325157 

DNA325202 



DNA323817 

DNA323871 

DNA324090 

DNA324155 

DNA226547 

DNA324418 

DNA324501 

DNA324521 

DNA324554 

DNA324576 

DNA324624 

DNA324697 

DNA324785 

DNA324844 

DNA324926 

DNA325034 

DNA325164 

DNA325206 



DNA323820 

DNA323914 

DNA324091 

DNA324200 

DNA324295 

DNA324423 

DNA324502 

DNA324525 

DNA324555 

DNA324579 

DNA324632 

DNA324717 

DNA324796 

DNA324866 

DNA324989 

DNA325111 

DNA325179 

DNA325222 



DNA323829 

DNA323947 

DNA324092 

DNA324201 

DNA324326 

DNA324437 

DNA324503 

DNA324541 

DNA324556 

DNA324595 

DNA324641 

DNA324720 

DNA324797 

DNA324902 

DNA325015 

DNA325116 

DNA325182 

DNA325229 



DNA323864 

DNA323964 

DNA324111 

DNA324210 

DNA324347 

DNA324472 

DNA324504 

DNA324549 

DNA324557 

DNA324596 

DNA324645 

DNA324737 

DNA150772 

DNA324904 

DNA325024 

DNA131588 

DNA325183 

DNA325231 



DNA323867 

DNA324004 

DNA324112 

DNA324230 

DNA324390 

DNA324483 

DNA324505 

DNA324550 

DNA324558 

DNA324604 

DNA324682 

DNA324756 

DNA324828 

DNA324905 

DNA325026 

DNA325156 

DNA325184 

DNA325232 
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DNA325234 


DNA325236 


DNA325250 


DNA325301 


DNA325303 


DNA325326 


DNA325339 


DNA325340 


DNA325347 


DNA325358 


DNA325395 


DNA325430 


DNA325437 


DNA325451 


DNA325452 


DNA325523 


DNA325558 


DNA325570 


DNA325576 


DNA325601 


DNA225632 


DNA325633 


DNA325731 


DNA325733 


DNA325736 


DNA325762 


DNA325786 


DNA302016 


DNA325789 


DNA325806 


DNA325810 


DNA325811 


DNA325812 


DNA325843 


DNA325844 


DNA325906 


DNA325908 


DNA325913 


DNA325922 


DNA325935 


DNA325985 


DNA326002 


DNA326041 


DNA326046 


DNA326099 


DNA326233 


DNA326234 


DNA326251 


DNA97300 


DNA304715 


DNA326286 


DNA326289 


DNA326381 


DNA326457 


DNA326580 


DNA326633 


DNA326634 


DNA326635 


DNA326651 


DNA290260 


DNA326796 


DNA326884 


DNA326886 


DNA326974 


DNA326977 


DNA327005 


DNA327025 


DNA327060 


DNA327062 


DNA327067 


DNA327114 





EXAMPLE 2: Use of TAT as a hybridization probe 

The following method describes use of a nucleotide sequence encoding TAT as a hybridization probe 
for, i.e., diagnosis of the presence of a tumor in a mammal. 

DNA comprising the coding sequence of full-length or mature TAT as disclosed herein can also be 
employed as a probe to screen for homologous DNAs (such as those encoding naturally-occurring variants of 
TAT) in human tissue cDNA libraries or human tissue genomic libraries. 
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Hybridization and washing of filters containing either library DNAs is performed under the following 
high stringency conditions. Hybridization of radiolabeled TAT-derived probe to the filters is performed in a 
solution of 50% formamide, 5x SSC, 0.1% SDS, 0.1% sodium pyrophosphate, 50 mM sodium phosphate, pH 
6.8, 2x Denhardt's solution, and 10% dextran sulfate at 42°C for 20 hours. Washing of the filters is performed 
in an aqueous solution of 0. lx SSC and 0.1% SDS at 42°C. 
5 DNAs having a desired sequence identity with the DNA encoding full-length native sequence TAT can 

then be identified using standard techniques known in the art. 

EXAMPLE 3: Expression of TAT in E. coli 

This example illustrates preparation of an unglycosylated form of TAT by recombinant expression in 

10 E. colL 

The DNA sequence encoding TAT is initially amplified using selected PCR primers. The primers 
should contain restriction enzyme sites which correspond to the restriction enzyme sites on the selected 
expression vector. A variety of expression vectors may be employed. An example of a suitable vector is 
pBR322 (derived from E. coli; see Bolivar et al., Gene. 2:95 (1977)) which contains genes for ampicillin and 
15 tetracycline resistance. The vector is digested with restriction enzyme and dephosphorylated. The PCR 

amplified sequences are then ligated into the vector. The vector will preferably include sequences which encode 
for an antibiotic resistance gene, a trp promoter, a polyhis leader (including the first six STII codons, polyhis 
sequence, and enterokinase cleavage site), the TAT coding region, lambda transcriptional terminator, and an 
argU gene. 

20 The ligation mixture is then used to transform a selected E. coli strain using the methods described in 

Sambrook et al., supra. Transformants are identified by their ability to grow on LB plates and antibiotic 
resistant colonies are then selected. Plasmid DNA can be isolated and confirmed by restriction analysis and. 
DNA sequencing. 

Selected clones can be grown overnight in liquid culture medium such as LB broth supplemented with 
25 antibiotics. The overnight culture may subsequently be used to inoculate a larger scale culture. The cells are 
then grown to a desired optical density, during which the expression promoter is turned on. 

After culturing the cells for several more hours, the cells can be harvested by centrifiigation. The cell 
pellet obtained by the centrifiigation can be solubilized using various agents known in the art, and the solubilized 
TAT protein can then be purified using a metal chelating column under conditions that allow tight binding of 
30 the protein. 

TAT may be expressed in E. coli in a poly-His tagged form, using the following procedure. The DNA 
encoding TAT is initially amplified using selected PCR primers. The primers will contain restriction enzyme 
sites which correspond to the restriction enzyme sites on the selected expression vector, and other useful 
sequences providing for efficient and reliable translation initiation, rapid purification on a metal chelation 
35 column, and proteolytic removal with enterokinase. The PCR-amplified, poly-His tagged sequences are then 
ligated into an expression vector, which is used to transform an E. coli host based on strain 52 (W3110 
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fuhA(tonA) Ion galE rpoHts(htpRts) clpP(ladq). Transformants are first grown in LB containing 50 mg/ml 
carbenicillin at 30°C with shaking until an O.D.600 of 3-5 is reached. Cultures are then diluted 50-100 fold 
into CRAP media (prepared by mixing 3.57 g (NH^SC^, 0.71 g sodium citrate«2H20, 1.07 g KC1, 5.36 g 
Difco yeast extract, 5.36 g Sheffield hycase SF in 500 mL water, as well as 110 mM MPOS, pH 7.3, 0.55% 
(w/v) glucose and 7 mM MgSO^ and grown for approximately 20-30 hours at 30°C with shaking. Samples are 
5 removed to verify expression by SDS-PAGE analysis, and the bulk culture is centrifuged to peUet the cells. 
Cell pellets are frozen until purification and refolding. 

E. coli paste from 0.5 to 1 L fermentations (6-10 g pellets) is resuspended in 10 volumes (w/v) in 7 
M guanidine, 20 mM Tris, pH 8 buffer. Solid sodium sulfite and sodium tetrathionate is added to make final 
concentrations of 0. 1M and 0.02 M, respectively, and the solution is stirred overnight at 4°C. This step results 

10 in a denatured protein with all cysteine residues blocked by sulfitolization. The solution is centrifuged at 40,000 
rpm in a Beckman Ultracentifuge for 30 min. The supernatant is diluted with 3-5 volumes of metal chelate 
column buffer (6 M guanidine, 20 mM Tris, pH 7.4) and filtered through 0.22 micron filters to clarify. The 
clarified extract is loaded onto a 5 ml Qiagen Ni-NTA metal chelate column equilibrated in the metal chelate 
column buffer. The column is washed with additional buffer containing 50 mM imidazole (Calbiochem, Utrol 

1 5 grade), pH 7.4. The protein is eluted with buffer containing 250 mM imidazole. Fractions containing the 
desired protein are pooled and stored at 4°C. Protein concentration is estimated by its absorbance at 280 nm 
using the calculated extinction coefficient based on its amino acid sequence. 

The proteins are refolded by diluting the sample slowly into freshly prepared refolding buffer consisting 
of: 20 mM Tris, pH 8.6, 0.3 M NaCl, 2.5 M urea, 5 mM cysteine, 20 mM glycine and 1 mM EDTA. 

20 Refolding volumes are chosen so that the final protein concentration is between 50 to 100 micrograms/ml. The 
refolding solution is stirred gently at 4°C for 12-36 hours. The refolding reaction is quenched by the addition 
of TFA to a final concentration of 0.4% (pH of approximately 3). Before further purification of the protein, 
the solution is filtered through a 0.22 micron filter and acetonitrile is added to 2-10% final concentration. The 
refolded protein is chromatographed on a Poros Rl/H reversed phase column using a mobile buffer of 0.1% 

25 TFA with elution with a gradient of acetonitrile from 10 to 80% . Aliquots of fractions with A280 absorbance 
are analyzed on SDS polyacrylamide gels and fractions containing homogeneous refolded protein are pooled. 
Generally, the properly refolded species of most proteins are eluted at the lowest concentrations of acetonitrile 
since those species are the most compact with their hydrophobic interiors shielded from interaction with the 
reversed phase resin. Aggregated species are usually eluted at higher acetonitrile concentrations. In addition 

30 to resolving misfolded forms of proteins from the desired form, the reversed phase step also removes endotoxin 
from the samples. 

Fractions containing the desired folded TAT polypeptide are pooled and the acetonitrile removed using 
a gentle stream of nitrogen directed at the solution. Proteins are formulated into 20 mM Hepes, pH 6.8 with 
0. 14 M sodium chloride and 4% mannitol by dialysis or by gel filtration using G25 Superfine (Pharmacia) resins 
35 equilibrated in the formulation buffer and sterile filtered. 

Certain of the TAT polypeptides disclosed herein have been successfully expressed and purified using 
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this technique(s). 

EXAMPLE 4: Expression of TAT in mammalian cells 

This example illustrates preparation of a potentially glycosylated form of TAT by recombinant 
expression in mammalian cells. 
5 The vector, pRK5 (see EP 307,247, published March 15, 1989), is employed as the expression vector. 

Optionally, the TAT DNA is ligated into pRK5 with selected restriction enzymes to allow insertion of the TAT 
DNA using ligation methods such as described in Sambrook et al. , supra. The resulting vector is called pRK5- 
TAT. 

In one embodiment, the selected host cells may be 293 cells. Human 293 cells (ATCC CCL 1573) are 

1 0 grown to confluence in tissue culture plates in medium such as DMEM supplemented with fetal calf serum and 
optionally, nutrient components and/or antibiotics. About 10 \ig pRK5-TAT DNA is mixed with about 1 jig 
DNA encoding the VA RNA gene [Thimmappaya et al., CeU, 31:543 (1982)] and dissolved in 500 \xl of 1 mM 
Tris-HCl, 0. 1 mM EDTA, 0.227 M CaCl 2 . To this mixture is added, dropwise, 50Qxl of 50 mM HEPES (pH 
7.35), 280 mM NaCl, 1.5 mM NaPO 4 , and a precipitate is allowed to form for 10 minutes at 25°C. The 

15 precipitate is suspended and added to the 293 cells and allowed to settle for about four hours at 37°C. The 
culture medium is aspirated off and 2 ml of 20% glycerol in PBS is added for 30 seconds. The 293 cells are 
then washed with serum free medium, fresh medium is added and the cells are incubated for about 5 days. 

Approximately 24 hours after the transfections, the culture medium is removed and replaced with 
culture medium (alone) or culture medium containing 200 nCi/ml 35 S-cysteine and 200 fiCi/ml 35 S-methionine. 

20 After a 12 hour incubation, the conditioned medium is collected, concentrated on a spin filter, and loaded onto 
a 15 % SDS gel. The processed gel may be dried and exposed to film for a selected period of time to reveal the 
presence of TAT polypeptide. The cultures containing transfected cells may undergo further incubation (in 
serum free medium) and the medium is tested in selected bioassays. 

In an alternative technique, TAT may be introduced into 293 cells transiently using the dextran sulfate 

25 method described by Somparyrac et al., Proc. Natl. Acad. Sci. . 12:7575 (1981). 293 cells are grown to 
maximal density in a spinner flask and 700 \xg pRK5-TAT DNA is added. The cells are first concentrated from 
the spinner flask by centrifiigation and washed with PBS. The DNA-dextran precipitate is incubated on the cell 
pellet for four hours. The cells are treated with 20% glycerol for 90 seconds, washed with tissue culture 
medium, and re-introduced into the spinner flask containing tissue culture medium, 5 ng/ml bovine insulin and 

30 0. 1 ng/ml bovine transferrin. After about four days, die conditioned media is centrifiaged and filtered to remove 
cells and debris. The sample containing expressed TAT can then be concentrated and purified by any selected 
method, such as dialysis and/or column chromatography. 

In another embodiment, TAT can be expressed in CHO cells. The pRK5-TAT can be transfected into 
CHO cells using known reagents such as CaP0 4 or DEAE-dextran. As described above, the cell cultures can 

35 be incubated, and the medium replaced with culture medium (alone) or medium containing a radiolabel such 
as 35 S-methionine. After determining the presence of TAT polypeptide, the culture medium may be replaced 

367 



WO 2004/030615 



PCT7US2003/028547 



with serum free medium. Preferably, the cultures are incubated for about 6 days, and then the conditioned 
medium is harvested. The medium containing the expressed TAT can then be concentrated and purified by any 
selected method. 

Epitope-tagged TAT may also be expressed in host CHO cells. The TAT may be subcloned out of die 
pRK5 vector. The subclone insert can undergo PCR to fuse in frame with a selected epitope tag such as a poly- 
5 his tag into a Baculovirus expression vector. The poly-his tagged TAT insert can then be subcloned into a SV40 
driven vector containing a selection marker such as DHFR for selection of stable clones. Finally, the CHO cells 
can be transfected (as described above) with the SV40 driven vector. Labeling may be performed, as described 
above, to verify expression. The culture medium containing the expressed poly-His tagged TAT can then be 
concentrated and purified by any selected method, such as by Ni 2+ -chelate affinity chromatography. 
1 0 TAT may also be expressed in CHO and/or COS cells by a transient expression procedure or in CHO 

cells by another stable expression procedure. 

Stable expression in CHO cells is performed using the following procedure. The proteins are expressed 
as an IgG construct (immunoadhesin), in which the coding sequences for the soluble forms (e.g. extracellular 
domains) of the respective proteins are fused to an IgGl constant region sequence containing the hinge, CH2 
1 5 and CH2 domains and/or is a poly-His tagged form. 

Following PCR amplification, the respective DNAs are subcloned in a CHO expression vector using 
standard techniques as described in Ausubel et al., Current Protocols of Molecular Biology. Unit 3.16, John 
Wiley and Sons (1997). CHO expression vectors are constructed to have compatible restriction sites 5' and 3* 
of the DNA of interest to allow the convenient shuttling of cDNA's. The vector used expression in CHO cells 
20 is as described in Lucas etal., Nucl. Acids Res. 24:9 (1774-1779 (1996), and uses the SV40 early 
promoter/enhancer to drive expression of the cDNA of interest and dihydrofolate reductase (DHFR). DHFR 
expression permits selection for stable maintenance of the plasmid following transfection. 

Twelve micrograms of the desired plasmid DNA is introduced into approximately 10 million CHO cells 
using commercially available transfection reagents Superfect* (Quiagen), Dosper* or Fugene* (Boehringer 
25 Mannheim). The cells are grown as described in Lucas et al., supra. Approximately 3 x 10 7 cells are frozen 
in an ampule for further growth and production as described below. 

The ampules containing the plasmid DNA are thawed by placement into water bath and mixed by 
vortexing. The contents are pipetted into a centrifuge tube containing 10 mLs of media and centrifuged at 1000 
rpm for 5 minutes. The supernatant is aspirated and the cells are resuspended in 10 mL of selective media (0.2 
30 fim filtered PS20 with 5% 0.2 ^m diafiltered fetal bovine serum). The cells are then aliquoted into a 100 mL 
spinner containing 90 mL of selective media. After 1-2 days, the cells are transferred into a 250 mL spinner 
filled with 150 mL selective growth medium and incubated at 37°C. After another 2-3 days, 250 mL, 500 mL 
and 2000 mL spinners are seeded with 3 x 10 s cells/mL. The cell media is exchanged with fresh media by 
centrifugation and resuspension in production medium. Although any suitable CHO media may be employed, 
35 a production medium described in U.S. Patent No. 5,122,469, issued June 16, 1992 may actually be used. A 
3L production spinner is seeded at 1.2 x 10 6 cells/mL. On day 0, the cell number pH ie determined. On day 
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1. the spinner is sampled and sparging with filtered air is commenced. On day 2, the spinner is sampled, the 
temperature shifted to 33°C, and 30 mL of 500 g/L glucose and 0.6 mL of 10% antifoam (e.g., 35% 
polydimethylsiloxane emulsion, Dow Corning 365 Medical Grade Emulsion) taken. Throughout the production, 
the pH is adjusted as necessary to keep it at around 7.2. After 10 days, or until the viability dropped below 
70%, the cell culture is harvested by centrifugation and filtering through a 0.22 /un filter. The filtrate was 
5 either stored at 4°C or immediately loaded onto columns for purification. 

For the poly-His tagged constructs, the proteins are purified using a Ni-NTA column (Qiagen). Before 
purification, imidazole is added to the conditioned media to a concentration of 5 mM. The conditioned media 
is pumped onto a 6 ml Ni-NTA column equilibrated in 20 mM Hepes, pH 7.4, buffer containing 0.3 M NaCl 
and 5 mM imidazole at a flow rate of 4-5 ml/min. at 4°C. After loading, the column is washed with additional 

10 equilibration buffer and the protein eluted with equilibration buffer containing 0.25 M imidazole. The highly 
purified protein is subsequently desalted into a storage buffer containing 10 mM Hepes, 0.14 M NaCl and 4% 
mannitol, pH 6.8, with a 25 ml G25 Superfine (Pharmacia) column and stored at -80°C. 

Immunoadhesin (Fc-containing) constructs are purified from the conditioned media as follows. The 
conditioned medium is pumped onto a 5 ml Protein A column (Pharmacia) which had been equilibrated in 20 

15 mM Na phosphate buffer, pH 6.8. After loading, the column is washed extensively with equilibration buffer 
before elution with 100 mM citric acid, pH 3.5. The eluted protein is immediately neutralized by collecting 
1 ml fractions into tubes containing 275 /iL of 1 M Tris buffer, pH 9. The highly purified protein is 
subsequently desalted into storage buffer as described above for the poly-His tagged proteins. The homogeneity 
is assessed by SDS polyacrylamide gels and by N-terminal amino acid sequencing by Edman degradation. 

20 Certain of die TAT polypeptides disclosed herein have been successfully expressed and purified using 

this technique(s). 

EXAMPLE 5: Expression of TAT in Yeast 

The following method describes recombinant expression of TAT in yeast. 

25 First ' y east expression vectors are constructed for intracellular production or secretion of TAT from 

the ADH2/GAPDH promoter. DNA encoding TAT and the promoter is inserted into suitable restriction enzyme 
sites in the selected plasmid to direct intracellular expression of TAT. For secretion, DNA encoding TAT can 
be cloned into the selected plasmid, together with DNA encoding the ADH2/GAPDH promoter, a native TAT 
signal peptide or other mammalian signal peptide, or, for example, a yeast alpha-factor or invertase secretory 

30 signal/leader sequence, and linker sequences (if needed) for expression of TAT. 

Yeast cells, such as yeast strain AB1 10, can then be transformed with the expression plasmids described 
above and cultured in selected fermentation media. The transformed yeast supernatants can be analyzed by 
precipitation with 10% trichloroacetic acid and separation by SDS-PAGE, followed by staining of the gels with 
Coomassie Blue stain. 

35 Recombinant TAT can subsequently be isolated and purified by removing the yeast cells from the 

fermentation medium by centrifugation and then concentrating the medium using selected cartridge filters. The 
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concentrate containing TAT may further be purified using selected column chromatography resins. 

Certain of the TAT polypeptides disclosed herein have been successfully expressed and purified using 
this technique(s). 

EXAMPLE 6: Expression o f TAT in Baculovirus-Infected Insect Cells 

The following method describes recombinant expression of TAT in Baculovirus-infected insect cells. 

The sequence coding for TAT is fused upstream of an epitope tag contained within a baculovirus 
expression vector. Such epitope tags include poly-his tags and immunoglobulin tags (like Fc regions of IgG). 
A variety of plasmids may be employed, including plasmids derived from commercially available plasmids such 
as pVL1393 (Novagen). Briefly, the sequence encoding TAT or the desired portion of the coding sequence of 
TAT such as me sequence encoding an extracellular domain of a transmembrane protein or the sequence 
encoding the mature protein if the protein is extracellular is amplified by PCR with primers complementary to 
the 5' and 3' regions. The 5' primer may incorporate flanking (selected) restriction enzyme sites. The product 
is then digested with those selected restriction enzymes and subcloned into the expression vector. 

Recombinant baculovirus is generated by co-transfecting the above plasmid and BaculoGold™ virus 
DNA (Pharmingen) into Spodopterafrugiperda ("Sf9") cells (ATCC CRL 171 1) using lipofectin (commercially 
available from GIBCO-BRL). After 4 - 5 days of incubation at 28°C, the released viruses are harvested and 
used for further amplifications. Viral infection and protein expression are performed as described by O'Reilley 
et Baculovirus expression vectors: A Lahorat n rv Manual Oxford: Oxford University Press (1994). 

Expressed poly-his tagged TAT can then be purified, for example, by Ni 2+ -chelate affinity 
chromatography as follows. Extracts are prepared from recombinant virus-infected Sf9 cells as described by 
Rupert etal., Nature, 362:175-179(1993). Briefly, Sf9 cells are washed, resuspended in sonication buffer (25 
mL Hepes, pH 7.9; 12.5 mM MgCl,; 0. 1 mM EDTA; 10% glycerol; 0. 1 % NP-40; 0.4 M KC1), and sonicated 
twice for 20 seconds on ice. The sonicates are cleared by centrifugation, and the supernatant is diluted 50-fold 
in loading buffer (50 mM phosphate, 300 mM NaCl, 10% glycerol, pH 7.8) and filtered through a 0.45 M m 
filter. A Ni 2+ -NTA agarose column (commercially available from Qiagen) is prepared with a bed volume of 
5 mL, washed with 25 mL of water and equilibrated with 25 mL of loading buffer. The filtered cell extract is 
loaded onto the column at 0.5 mL per minute. The column is washed to baseline A 280 with loading buffer, at 
which point fraction collection is started. Next, the column is washed with a secondary wash buffer (50 mM 
phosphate; 300 mM NaCl, 10% glycerol, pH 6.0), which elutes nonspecifically bound protein. After reaching 
A 280 baseline again, the column is developed with a 0 to 500 mM Imidazole gradient in the secondary wash 
buffer. One mL fractions are collected and analyzed by SDS-PAGE and silver staining or Western blot with 
Ni 2+ -NTA-conjugated to alkaline phosphatase (Qiagen). Fractions containing the eluted His 0 -tagged TAT are 
pooled and dialyzed against loading buffer. 

Alternatively, purification of the IgG tagged (or Fc tagged) TAT can be performed using known 
chromatography techniques, including for instance, Protein A or protein G column chromatography. 

Certain of the TAT polypeptides disclosed herein have been successfully expressed and purified using 
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this technique(s). 



EXAMPLE 7: Preparation of Antibodies that Bind TAT 

This example illustrates preparation of monoclonal antibodies which can specifically bind TAT. 

Techniques for producing the monoclonal antibodies are known in the art and are described, for 
instance, in Goding, supra. Immunogens that may be employed include purified TAT, fusion proteins 
containing TAT, and cells expressing recombinant TAT on the cell surface. Selection of the immunogen can 
be made by the skilled artisan without undue experimentation. 

Mice, such as Balb/c, are immunized with the TAT immunogen emulsified in complete Freund's 
adjuvant and injected subcutaneously or intraperitoneally in an amount from 1-100 micrograms. Alternatively, 
the immunogen is emulsified in MPL-TDM adjuvant (Ribi Immunochemical Research, Hamilton, MT) and 
injected into the animal's hind foot pads. The immunized mice are then boosted 10 to 12 days later with 
additional immunogen emulsified in the selected adjuvant. Thereafter, for several weeks, the mice may also 
be boosted with additional immunization injections. Serum samples may be periodically obtained from the mice 
by retro-orbital bleeding for testing in ELISA assays to detect anti-TAT antibodies. 

After a suitable antibody titer has been detected, the animals "positive" for antibodies can be injected 
with a final intravenous injection of TAT. Three to four days later, the mice are sacrificed and the spleen cells 
are harvested. The spleen cells are then fused (using 35% polyethylene glycol) to a selected murine myeloma 
cell line such as P3X63AgU.l, available from ATCC, No. CRL 1597. The fusions generate hybridoma cells 
which can then be plated in 96 well tissue culture plates containing HAT (hypoxanthine, aminopterin, and 
thymidine) medium to inhibit proliferation of non-fused cells, myeloma hybrids, and spleen cell hybrids. 

The hybridoma cells will be screened in an ELISA for reactivity against TAT. Determination of 
"positive" hybridoma ceDs secreting the desired monoclonal antibodies against TAT is within the skill in the art. 

The positive hybridoma cells can be injected intraperitoneally into syngeneic Balb/c mice to produce 
ascites containing the anti-TAT monoclonal antibodies. Alternatively, the hybridoma cells can be grown in 
tissue culture flasks or roller bottles. Purification of the monoclonal antibodies produced in the ascites can be 
accomplished using ammonium sulfate precipitation, followed by gel exclusion chromatography. Alternatively, 
affinity chromatography based upon binding of antibody to protein A or protein G can be employed. 

EXAMPLE 8: Purification of TAT Polypeptides Using Specific Antibodies 

Native or recombinant TAT polypeptides may be purified by a variety of standard techniques in the 
art of protein purification. For example, pro-TAT polypeptide, mature TAT polypeptide, or pre-TAT 
polypeptide is purified by immunoaffinity chromatography using antibodies specific for the TAT polypeptide 
of interest. In general, an immunoaffinity column is constructed by covalently coupling the anti-TAT 
polypeptide antibody to an activated chromatographic resin. 

Polyclonal immunoglobulins are prepared from immune sera either by precipitation with ammonium 
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sulfate or by purification on immobilized Protein A (Pharmacia LKB Biotechnology, Piscataway, N.J.). 
Likewise, monoclonal antibodies are prepared from mouse ascites fluid by ammonium sulfate precipitation or 
chromatography on immobilized Protein A. Partially purified immunoglobulin is covalently attached to a 
chromatographic resin such as CnBr-activated SEPHAROSE™ (Pharmacia LKB Biotechnology). The antibody 
is coupled to the resin, the resin is blocked, and the derivative resin is washed according to the manufacturer's 
5 instructions. 

Such an immunoaffinity column is utilized in the purification of TAT polypeptide by preparing a 
fraction from cells containing TAT polypeptide in a soluble form. This preparation is derived by solubilization 
of the whole cell or of a subcellular fraction obtained via differential centrifugation by the addition of detergent 
or by other methods well known in the art. Alternatively, soluble TAT polypeptide containing a signal sequence 

1 0 may be secreted in useful quantity into the medium in which the cells are grown. 

A soluble TAT polypeptide-containing preparation is passed over the immunoaffinity column, and the 
column is washed under conditions that allow the preferential absorbance of TAT polypeptide (e.g. , high ionic 
strength buffers in the presence of detergent). Then, the column is eluted under conditions that disrupt 
antibody/TAT polypeptide binding (e.g. , a low pH buffer such as approximately pH 2-3, or a high concentration 

15 of a chaotrope such as urea or thiocyanate ion), and TAT polypeptide is collected. 

EXAMPLE 9: In Vitro Tumor Cell Killing Assay 

Ma mm alian cells expressing the TAT polypeptide of interest may be obtained using standard expression 
vector and cloning techniques. Alternatively, many tumor cell lines expressing TAT polypeptides of interest 
20 are publicly available, for example, through the ATCC and can be routinely identified using standard ELISA 
or FACS analysis. Anti-TAT polypeptide monoclonal antibodies (and toxin conjugated derivatives thereof) may 
then be employed in assays to determine the ability of the antibody to kill TAT polypeptide expressing cells in 
vitro. 

For example, cells expressing the TAT polypeptide of interest are obtained as described above and 
25 plated into 96 well dishes. In one analysis, the antibody/toxin conjugate (or naked antibody) is included 

throughout the cell incubation for a period of 4 days. In a second independent analysis, the cells are incubated 
for 1 hour with the antibody/toxin conjugate (or naked antibody) and then washed and incubated in the absence 
of antibody/toxin conjugate for a period of 4 days. Cell viability is then measured using the CellTiter-Glo 
Luminescent Cell Viability Assay from Promega (Cat# G7571). Untreated cells serve as a negative control. 

30 

EXAMPLE 10: In Vivo Tumor Cell Killing Assay 

To test the efficacy of conjugated or unconjugated anti-TAT polypeptide monoclonal antibodies, anti- 
TAT antibody is injected intraperitoneally into nude mice 24 hours prior to receiving tumor promoting cells 
subcutaneously in the flank. Antibody injections continue twice per week for the remainder of the study. 
35 Tumor volume is then measured twice per week. 

The foregoing written specification is considered to be sufficient to enable one skilled in the art to 
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practice the invention. The present invention is not to be limited in scope by the construct deposited, since the 
deposited embodiment is intended as a single illustration of certain aspects of the invention and any constructs 
that are functionally equivalent are within the scope of this invention. The deposit of material herein does not 
constitute an admission that the written description herein contained is inadequate to enable the practice of any 
aspect of the invention, including the best mode thereof, nor is it to be construed as limiting the scope of the 
claims to the specific illustrations that it represents. Indeed, various modifications of the invention in addition 
to those shown and described herein will become apparent to those skilled in the art from the foregoing 
description and fall within the scope of the appended claims. 
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